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Disclosed are new methods comprising the use of in situ hybridization to detect abnormal nucleic add sequence copy num- 
bers in one or more genomes wherein repetitive sequences that bind to multiple lod in a reference chromosome spred are either 
substantially removed and/or their hybridization signals suppressed. The invention termed Comparative Genomic Hybridization 
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example, a normal human genome). The intensity(ies) of the signals from each labeled subject nucleic add and/or the differences 
in the ratios between different signals from the labeled subject nucleic add sequences are compared to determine the relative copy 
numbers of the nucleic add sequences in the one or more subject genomes as a function of position along the reference chromo- 
some spread. Amplifications, duplications and/or deletions in the subject genome(s) can be deteaed. Also provided is a method 
of determining the absolute copy numbers of substantially all RNA or DNA sequences in subject cell(s) or cell population(s). 
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COMPARATIVE GENOMIC HYBRIDIZATION fCGH^ 
FIELD OF THE INVENTION 
This invention relates generally to the field of 
cytogenetics, and more particularly to the field of molecular 
5 cytogenetics* It concerns methods of detenaining the relative 
copy numbers of different nucleic acid sequences in a subject 
cell or cell population and/or comparing the nucleic acid 
sec[uence copy numbers of substantially identical secpiences in 
several cells or cell populations as a function of the 

10 location of those sequences in a reference genome. For 

instance, the methods of this invention provide the means to 
determine the relative nvimber of copies of nucleic acid 
sequences in one or more siibject genomes (for example, the DNA 
of one tumor cell or a number of cells from a subregion of a 

15 solid tumor) or portions thereof as a function of the location 
of those sequences in a reference genome (for example, a 
normal humzm metaphase spread) . Further, the invention 
provides methods of determining the absolute copy mnnber of 
nucleic acid sequences in a subject cell or cell population. 

20 Although the examples herein concern hvunem cells and 

the language is primarily directed to human concerns, the 
concept of this invention is applicable to genomes from any 
plant or aniiaal. The genomes compared need only be related 
closely enough, to have sufficient substantially identical 

25 secpiences for a meeiningful analysis. For example, a humam 

genome and that of another primate could be compared according 
to the methods of this invention. 

BACKGROUND OF THE INVENTION 

30 chromosome aUbnormalities eure associated with genetic 

disorders, degenerative diseases, and exposure to agents known 
to cause degenerative diseases, particularly cancer, German, 
"Studying Human Chromosomes Today," American Scientist, SB! 
182-201 (1970); Yunis; "The Chromosomal Basis of Hunan 

35 Neoplasia," Science. 221 ; 227-236 (1983); and German, 
"Clinical Implication of Chromosom Breakage," in Genetic 
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Damaga in Man Cause d bv Environmental Agents. Berg, Ed., pgs. 
65-86 (Academic Press, New York, 1979). Chromosomal 
abnormalities can be of several types, including: extra or 
missing individual chromosomes, extra or missing portions of a 
5 chromosome (segmental duplications or deletions) , breaks, 
rings and chromosomal rearrangements, among others. 
Chromosomal or genetic rearremgements include translocations 
(transfer of a piece from one chromosome onto another 
chromosome) , dicentrics (chromosomes with two centromeres) , 

10 inversions (reversal in polarity of a chromosomal segment) , 
insertions, amplifications, and deletions. 

Detectable chromosomal 2d3normalities occtir with a 
frequency of one in every 250 humem births. Abnormalities 
that involve deletions or additions of chromosomal material 

15 alter the gene balance of an orgcuiism and generally lead to 
fetal death or to serious mental and physical defects. Down 
syndrome can be caused by having three copies of chromosome 21 
instead of the normal 2. This syndrome is an example of a 
condition caused by abnormal chromosome number, or aneuploidy. 

20 Down syndrome can also be caused by a segmental duplication of 
a subregion on chromosome 21 (such as, 21g22), which can be 
present on chromosome 21 or on emother chromosome. Edward 
syndrome (18+), Patau syndrome (13+), Turner syndrome (XO) and 
Kleinfelter syndrome (XXY) are among the most common numerical 

25 aberrations. [Epstein, The consequences of Chromosome 

Tmbalanceg Principles, Mechanisms and Models (Cambridge Univ. 
Press 1986); Jacobs, Am, J, Epi demiol, 105; 180 (1977); and 
Lubs et al.. Science, 169 ; 495 (1970).] 

Retinoblastoma (del I3ql4) , Prader-Willis syndrome 

30 (del 15qll- ql3) , Wilm's tumor (del llpl3) and Cri-du-chat 
syndrome (del 5p) are examples of important disease linked 
structural aberrations. [Nora and Fraser, Medical Genetics: 
Principles and Practice, (Lea and Febiger (1989).] 

One of the critical endeavors in human medical 
35 research is the discovery of genetic abnormalities that are 
c ntraJ. to adverse health consequences. In many cases, clues 
to the location of specific genes and/ or critical diagnostic 
markers com from identification of portions of the genome 
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that are present at abnormal copy numbers. For example, in 
prenatal diagnosis, as indicated above, extra or missing 
copies of whole chromosomes are the most frequently occxirring 
genetic lesion. In cancer, deletion or multiplication of 
3 copies of whole chromo£<omes or chromosomal segments, and 

higher level amplifications of specific regions of the genome, 
are common occurrences. 

Much of such cytogenetic information has come over 
the last several decades from studies of chromosomes with 

10 light microscopy. For the past thirty years cytogeneticists 
have studied chromosomes in malignant cells to determine sites 
of recurrent abnormality to gleeui hints to the location of 
critical genes. Even though cytogenetic resolution is limited 
to several megabases by the con^lex packing of DNA into the 

15 chromosomes, this effort has yielded crucial information. 
Among the strengths of such traditional cytogenetics is the 
ability to give an overview of em entire genome at one time, 
permitting recognition of structxiral abnormalities such as 
inversions and translocations, as well as deletions, 

20 multiplications, and £unplificatlons of whole chromosomes or 
portions thereof. With the coming of cloning and detailed 
molecular analysis, recurrent translocation sites have been 
recognized as involved in the formation of chimeric genes such 
as the BCR-ABL fusion in chronic myelogeneous leukemia (CHL) ; 

25 deletions have been recognized as frequently indicating the 
location of tumor suppressor genes; and amplifications have 
been recognized as indicating overexpressed genes. 

Conventional procedures for genetic screening and 
biological dosimetry involve the analysis of keuryotypes. A 

30 karyotype is the peurticular chromosome complement of an 

individual or of a related group of individuals, as defined 
both by the number and morphology of the chromosomes usually 
in mitotic metaphase. It include such things as total 
chromosome number, copy ntimber of individual chromosome types 

35 (e.g., the numb r of copies of chrom s me X) , and chromosomal 
morphology, e.g., as measured by length, centr meric index, 
connect dn ss, or the lik . Karyotypes are conventionally 
determined by chemically staining an organism's m taphase, 
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prophase or otherwise condensed (for example, by premature 
chromosome condensation) chromosomes* Condensed chromosomes 
are used because, until recently, it has not been possible to * 
visualize interphase chromosomes due to their dispersed 
5 condition and the lack of visible boundaries between them in * 
the cell nucleus. 

A number of cytological techniques based upon 
chemical stains have been developed which produce longitudinal 
patterns on condensed chromosomes, generally referred to as 
10 bands. The banding pattern of each chromosome within an 
orgamism usually permits unambiguous identification of each 
chromosome type [Latt, "Optical Studies of Metaphase 
Chromosome Organization," Annual Review of Biophysics and 
Btoenaineerina. 5 ; 1-37 (1976)]. 
15 Unfortunately, such conventional banding analysis 

requires cell culturing and preparation of high quality 
metaphase spreads, which is time consuming and leibor 
intensive, and frequently difficult or impossible* For 
example, cells from many tumor types are difficult to culture, 
20 and it is not clear that the cultiired cells are representative 
of the original timer cell population. Fetal cells capable of 
being cultured, need to be cultured for several weeks to 
obtain enough metaphase cells for analysis. 

Over the past decade, methods of in situ 
25 hybridization have been developed that permit analysis of 
intact cell nuclei — interphase cytogenetics. Probes for 
chromosome centromeres, whole chromosomes, and chromosomal 
segments down to the size of genes, have been developed. With 
the use of such probes, the presence or absence of specific 
30 eUbnormalities can be very efficiently determined; however, it 
is tedious to test for numerous possible abnormalities or to 
survey to discover new regions of the genome that are altered - 
in a disease. 

The present invention. Comparative Genomic 
35 Hybridization (CGH) [formerly call d Copy Rati Revers 
Cytogenetics (CRRC) among other names] provides powerful 
methods to overcome many of the limitations of existing 
cytogenetic techniques. When CGH is applied, for example, in 
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the fields of tumor cytogenetics and prenatal diagnosis, it 
provides methods to determine whether there are abnormal copy 
numbers of nucleic acid sequences anywhere in the genome of a 
subject tximor cell or fetal cell or the genomes from 
5 representative cells from a tumor cell pc^pulation or from a 
number of fetal cells, without having to prepare condensed 
chromosome spreads from those cells. Thus, cytogenetic 
eibnormalities involving abnormal copy numbers of nucleic acid 
sequences, specifically amplifications and/ or deletions, can 

10 be found by , the methods of this invention in the format of an 
immediate overview of an entire genome or portions thereof. 
More specifically, CGH provides methods to compare and map the 
frequency of nucleic acid sequences from one or more subject 
genomes or portions thereof in relation to a reference genome. 

15 It permits the determination of the relative number of copies 
of nucleic acid sequences in one or more subject genomes (for 
example, those of tumor cells) as a function of the location 
of those sequences in a reference genome (for example, that of 
a normal human cell) • 

20 Gene amplification is one of several mechanisms 

whereby cells can change phenotypic expression when increased 
amounts of specific proteins are required, for example, during 
development [Spradling and Mahowald, PWAS rusAK 77; 1096- 
1100 (1980); Glover et al., PNAS ruSA^ , 79; 2947-2951 

25 (1982)], or during an environmental challenge When increased 
amoxints of specific proteins can impart resistance to 
cytotoxic agents [Melera et al., J> Biol, Chem. 255; 7024- 
7028 (1980); Beach and Palmiter, PNAS fUSA. 78 ; 2110-2114 
(1981)]. 

30 A major limitation of Southern emalysis and related 

conventional techniques for walysis of gene eunplif Ication is 
that only specific sites are studied leaving the vast majority 
of the genome unexamined. Conventional cytogenetic studies, 
on the other hand^ provide a broad survey of the genome but 

35 provide little information about gen s that may be involved in 
amplification events. However, the procedures of this 
invention ov rcome those limitations. This inventi n can be 
used to show the n rmal chromosomal locati ns of all regions 
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of a genome that are amplified or deleted wherein the size f 
the regions that can be detected is limited only by the 
resolution of the microscopy used and the orgsuiization of DNA 
in condensed chromosomes. Thus, this invention provides among 
5 other uses the ability to study gene amplifications and 
deletions and their roles in tumor development, progression 
and response to therapy more thoroughly than was possible 
previously. The methods of CGH are sufficiently rapid and 
simple that large niimbers of subject nucleic acids, for 

10 exsuaple from many tumors, can be analysed in studies for gene 
amplification and deletion. 

The karyotypic heterogeneity in solid tumors can be 
extreme. Identification of commonly occurring chromosomal 
changes by smalysis of metaphase spreads is often difficult or 

15 impossible using conventional bsmding analysis because of the 
complexity of the rearremgements and because of the poor 
quality of the metaphase preparations. CGH overcomes that 
limitation in that the tumor nucleic acid can be studied 
without the requirement of preparing metaphase spreads. Since 

20 CGH can probably be performed on single cells by amplifying 
the nucleic acid therefrom, CGH can be used to investigate the 
heterogeneity of tumors by studying representative cells from 
different cell populations of the tumor. Alternatively, CGH 
of nucleic acid from a tumor extracted in a bulk extraction 

25 process from many cells of the tumor can reveal consistencies 
within the apparent heterogeneity. For example, the same 
eunplif led sequences may appear as homogeneoiisly staining 
regions (HSRs) and/or double minute chromosomes (DHs) in one 
tumor cell but as an extension of a chromosome arm in another 

30 txudor cell. Thus, order from the appeurent randomness may be 
realized by the use of CGH. 

Montgomery et al., PNAS rusAK aor 5724-5728 
(September 1983), concerns the hybridization of labeled Cot 
fractionated DNAs from tumor cell lines (a Cot fraction from 

35 which the high copy repeats, low copy r peats and single copy 
sequences were substsmtially removed) to metaphase spreads 
from said tumor cell lines. Basically, M ntgom ry et al. 
mapped the positions of nucleic acid sequ nces from tumor c 11 
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lines that are very highly amplified back to tumor cell line 
genomes . 

Total genomic DNA from one species has been used in 
in situ hybridization to discriminate in hybrid cells between 
5 chromosomes of that species and of a different species on tl*3 
basis of the signal from the high copy repetitive sequences. 
[Pinkel et al., pnas fUSA^ . 83 ; 2934 (1986); Manuelidis, Hum. 
Genets, 71 ; 288 (1985); and Dumam et al., Somatic Cell 
Molec, Genet., 11; 571 (1985).] 

10 Landegent et al., Hum. Genet. .77 ; 366-370 (1987), 

eliminated highly repetitive sequences, like Alu and Kpn 
fragments, from whole cosmid cloned genomic sequences by 
blocking the highly repetitive sequences with Cot-1 DNA. The 
resulting probe was used for in situ hybridization. 

15 European Patent Application Publication No. 430,402 

(published Jxine 5, 1991) describes methods and compositions 
for chromosome-specific painting, that is, methods and 
compositions for, staining chromosomes based upon nucleic acid 
sequence employing high complexity nucleic acid probes. In 

20 general in the chromosome-specific painting methods, 

repetitive sequences not specific to the targeted nucleic acid 
sequences are removed from the hybridization mixture and/ or 
their hybridization capacity disabled, often by blocking with 
unlabeled genomic DNA or with DNA enriched for high copy 

25 repetitive sequences as is Cot-1 [comonercially available from 
Bethesda Research Laboratory, Gaithersbxirg, MD (USA) ] . Pinkel 
et al., PNAS (USA), £5: 9138-9142 (1988) also describes 
aspects of chromosome-specific painting as well as 
International Publication No. WO 90/05789 (published May 31, 

30 1990 entitled ** ln Situ Suppression Hybridization and Uses 
Therefor") . 

Chromosome-specific repeat sequence probes and 
chromosome-specific painting probes can be hybridized in situ 
to interphase nuclei as well as metaphase spreads and provide 
35 information about the genetic state of the individual teurgeted 
genomes. A limitation of such hybridizati ns is that 
cytog netic information is only pr vided from the regions to 
which the probes bind. Such hybridizations ar very useful 
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for determining if a particular abnormality is present, f r 
example, the deletion of a specific gene or a duplication 
among other abnormalities, but it is laborious to search for 
currently unknown abnormalities on a region by region basis. 

5 Other methods of searching for unknown genetic 

abnormalities similarly require a lot of work. For example, 
looking for loss of heterozygosity in tumor cells, requires 
the hybridization of many probes to Southern blots of tumor 
and normal cell DNA. CGH provides methods to overcome many of . 

10 the limitations of the existing cytogenetic techniques. 

Saint «Ruf et al.. Genes. Chromosomes & Cancer. 2: 
18-26 (1990) concluded from their studies of breast cancer 
that although amplification of genetic material is a frequent 
and probably importsmt event in breast carcinogenesis, that 

15 the relevant genes involved in such amplifications remain 
unknown but do not seem to correspond to the proto-oncogenes 
commonly considered important in breast cancer. 

Since HSRs in tumors are most often not at the site 
where the vmamplified gene is in normal cells, standard 

20 cytogenetics does not yield any information that could assist 
with identification of the gene(s). CGH on the other hand 
permits mapping them in the normal genome, a major step 
toweords their identification. 

Dutrillaux et al., Cancer Genet. Cvt oaenet. . 49; 

25 203-217 (1990) report (at page 203) that "[ajlthough human 
breast carcinomas are among the most frecpient malignant 
ttimors, cytogenetic data remain scarce, probably because of 
their great variability and of the frequent difficulty of 
their analysis." In their study of "30 cases with relatively 

30 single karyotypes to determine which emomalles occur the most 
frequently euid, in particular, early during tumor progression" 
(p. 203), they concluded that "trisomy Iq and monosomy 16q are 
early chromosomal changes in breast cancer, whereas other 
deletions and gain of 8q aure cleeurly secondary events." 

35 [ Abstract , p. 203.] Dutrillaux t al. further stat (at page 
216) that deletions within tumor suppressor genes 
"characterize tumor progression of breast c€mcer." 
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It is believed that many s lid tumors, such as 
breast cancer, progress from initiation to metastasis through 
the accumulation of several genetic aberrations. [Smith et 
al., Br?^$^ ganP^y R^^T Tr^atti 19 Suppl. l: S 5-14 (1991); 
5 van de Vijver and Nusse, Biochim> Biophvs. A cta. 1072 ; 33-50 
(1991); Sato et al,, Cancer Res. , 50 ; 7184-7189 (1990).] 
Such genetic aberrations, as they accumulate, may confer 
proliferative advamtages, genetic instability and the 
attendant ability to evolve drug resistance rapidly, and 

10 enhanced angiogenesis, proteolysis and metastasis. The 
genetic aberrations may affect either recessive **tumor . 
suppressor genes" or dominant ly acting oncogenes. Deletions 
and recombination leading to loss of heterozygosity (LOH) are 
believed to play a major role in tumor progression by 

15 uncoverin" mutated tumor suppressor alleles. 

Dominemtly acting genes associated with human solid 
tumors typically exert their effect by overexpression or 
altered expression. Gene amplification is a common mechanism 
leading to upregulation of gene expression. [Stark et al., 

20 Cell, 75 ; 901-908 (1989).] Evidence from cytogenetic studies 
indicates that significant euaplif ication occurs in over 50% of 
human breast cancers. [Saint-Ruf et al., supra. ] A veuriety 
of oncogenes have been found to be amplified in human 
malignancies. . Examples of the amplification of celluleu: 

25 oncogenes in humeui tumors is shown in Table l below. 
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Amplified 
Gene 



Tumor 



Degree of 
Amplification 



DM or HSR 
Present 



c-myc Promyelocytic leukemia 2 Ox 

5 cell line, HL60 

Small-cell lung 5 -3 Ox 

carcinoma cell lines 
N-myc Primary nexxroblastomas 5-lOOOx 
(stages III and IV) 
10 and neuroblastoma 

cell lines 
Retinoblastoma cell 10-2 OOx 

line and primcury 
tumors 

15 Small-cell lung carcinoma 50x 

cell lines and tumors 
L-myc Small-cell lung carcinoma 10-2 Ox 

cell lines and tumors 
c-myb Acute myeloid leukemia 5-lOx 
20 Colon carcinoma cell lines lOx 

c-erbB Epidermoid carcinoma cell 3 Ox 

Primeory gliomas 
c-K-ras-2 Primary caurcinomas of Ixuig, 4-20x 
colon, bladder, emd 
25 rectum 

N-ras Mammary ceurcinoma cell 5*10x 
line 



7 
7 



30 SOURCE: 



modified from Varmus, Ann, Rev, Genetics. 18: 553- 
612 (1984) [cited in Watson et al.. Molecular 
Bioloay of the Gene (4th ed. ; Benjamin/ Cummings 
Publishing Co. 1987)] 



35 Chromosomal deletions involving tumor suppressor 

genes may play an important role in the development and 
progression of solid tumors. The retinoblastoma tumor 
suppressor gene (Rb-1), located in chromosome 13ql4, is the 
most extensively cheuracterized tumor suppressor gene [Friend 

40 et al., mature, 32?; 643 (1986); Lee et al.. Science, 235 : 
1394 (1987); Fung et al. , Science. 236 : 1657 (1987)]. The 
Rb-1 gene product, a 105 kDa nuclear phosphoprotein, 
apparently plays an important role in cell cycl regulation 
[Lee et al. , suora (1987); H w et al^ PNAS (USA), fil: 5883 

45 (1990)]. Altered or lost expr ssion of the Rb protein is 
caused by inactivation of both gene alleles either through a 
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p int mutation or a chromos mal deletion. Rb**l gene 
alterations have been found to be present not only in 
retinoblastomas [Friend et al., supra (1986); Lee et al. , 
supra (1987); Fung et al., supra (1987)] but also in other 
3 malignancies such as osteosarcomas [Friend et al,, supra 
(1986)], small cell l\mg cancer [Hensel et al., Cancer Res> . 
50 ; 3067 (1990); Rygaard et al.. Cancer Res. . 50 : 5312 
(1990)] and breast cancer [Lee et al., Sqj^nce, 241: 218 
(1988); T'Ang et al., Spience, 24?; 263 (1988); Varley et 

10 al., Oncogene > 4 : 725 (1989)]. Restriction fragment length 
polymorphism (RFLP) studies have indicated that such tumor 
types have frequently lost heterozygosity at 13g suggesting 
that one of the Rb-1 gene alleles has been lost due to a gross 
chromosomal deletion [Bowcock et al.. Am, J, Hum. Genet. , 46; 

15 12. (1990)]. 

The deletion of the short arm of chromosome 3 has 
been associated with several cancers, for example, small cell 
lung cancer, renal and ovarian cemcers; it has been postulated 
that one or more putative tumor suppressor genes is or are 

20 located in the p region of chromosome 3 (ch. 3p) [Minna et 
al., Svmposia on Quantitative Biology . Vol. LI: 843-853 (SCH 
Lab 1986); Cohen et al., N. Eng. J. Med.. 301 ; 592-595 
(1979); Bergerham et al.. Cancer Res. . 49 ; 1390-1396 (1989); 
Whang-Peng et al.. Can. Genet. Cvtogenet. , 11; 91-106 (1984; 

25 and Trent et al.. Can. Genet. Cvtogenet.. 14 ; 153-161 
(1985)]. 

The above-indicated collection of €uaplif ied and 
deleted genes is far from complete. As the Saint-Ruf et al. 
study ( supra ^ of oncogene amplification in cells showing 

30 cytogenetic evidence of amplification, such as double minutes 
(DMs) or homogeneously staining regions (HSRs) , indicated, th 
amplified genes were not Icnown oncogenes in most cases. As 
Dutrillaux et al., supra indicated, ""cytogenetic data remains 
scarce" for "the most frequent malignant tumors" — breast 

35 ceurcinomas • 

Discov ry f genetic changes inv Ived in the 
development of solid tumors has pr ven difficult. Karyotyping 
is impeded by the low yield of high quality metaphases and the 
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complex natTire of chromosoiaal chsmges [Teyssier , J.R. , Cancer 
Genet , Cytoaenet: , . 37 : 103 (1989)]. Although mo leculcu: 
genetic studies of isolated tumor DNA have been more 
successful and permitted detection of common regions of 
5 allelic loss^ mutation or amplification [Fearon et al.. Cell. 
61: 759 (1990); Sato et al. , Cancer Res, , 50: 7184 (1990); 
Alitalo et al. , Adv. Cance r Res>, 47: 235 (1986); and Schwab 
and Amler, Genes Chrom. Cancer. . 1 : 181 (1990)], such 
moleculaur methods are highly focused, targeting one specific 

10 gene or chromosome region at a time, and leaving the majority 
of the genome unexamined. 

Thus, a research tool leading to the identification 
of amplified and deleted genes and providing more cytogenetic 
data regarding tumors, especially tumor progression euid 

15 invasiveness is needed in ttamor cytogenetics. CGH provides 
such a molecular cytogenetic research tool. 

The ed^ility to survey the whole genome in a single 
hybridization is a distinct advantage over allelic loss 
studies by restriction fragment length polymorphism (RFLP) 

20 that target only one locus at a time. RFLP is also restricted 
by the availability and informativeness of polymorphic probes. 

CGH facilitates the genetic analysis of tumors in 
that it provides a copy number karyotype of the entire genome 
in a single step. Regions of tumor DNA gain and loss are 

25 mapped directly onto normal chromosomes. Comparisons of 
primaury tumors with their metastases by CGH should be 
informative concerning cancer progression. Analogously, other 
genomes other than those of tumors can be studied by CGH. 

30 SUMMARY OF THE IKVENTION 

Comparative Genomic Hybridization (CGB) employs the 
kinetics of In situ hybridization to compare the copy numbers 
of different DNA or SNA sequences from a sample, or the copy 
numbers of different DNA or RNA sequences in one sample to the 
35 copy nvonbers of th substantially identical sequences in 

another sample. In many useful applications of CGH, the DNA 
or RNA is isolated from a subject c 11 or cell population. 
The compeurisons can be qualitative or quantitativ • 
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Procedures are described that permit determination of the 
absolute copy numbers of DNA sequences throughout the genome 
of a cell or cell population if the absolute copy number is 
known or determined for one or several sequences. The 
5 different sequences are discriminated from each other by the 
different locations of their binding sites when hybridized to 
a reference genome, usually metaphase chromosomes but in 
certain cases interphase nuclei. The copy number information 
originates from comparisons of the intensities of the 

10 hybridization signals among the different locations on the 
' reference genome. 

Two representative basic approaches are employed in 
CGH as illustrated herein for the analysis of subject DNAs. 
In an example of the first approach, genomic DNA from a 

15 subject cell or cell population of cells is isolated, labeled 
and hybridized to reference chromosomes, usually in metaphase. 
In an example of the second approach, genomic ONAs from two or 
more subject cells or cell populations are isolated, 
differentially labeled, and hybridized to reference 

20 chromosomes, usually in metaphase. 

The CGH methods of this invention can be qualitative 
and/or quantitative. A particular utility of CGH is for 
analysing DNA sequences from subject cell(s) or cell 
population (s) , for example from clinical specimens including 

25 tumor and fetal tissues. 

An important utility of CGH is to find regions in 
normal genomes which when altered in sequence copy number 
contribute to disease, as for example, cancer or birth 
defects. For example, regions at elevated copy number may 

30 contain oncogenes, and regions present at decreased copy 
nximber may contain tumor suppressor genes. 

A representative CGH method is for comparing copy 
numbers of different DNA sequences in a subject cell or cell 
population comprising the steps of: 

35 a) extracting the DNA from the subject cell r from 

a number of cells of the subject cell p pulation; 

b) eunplifying said xtracted subject DNA, if 

necessary; 
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c) labeling the subject DNA; 

d) hybridizing said labeled subject DNA in situ to 
reference metaphase chromosomes after substantially removing 
from the labeled DNA those repetitive sec[uences that could 

5 bind to multiple >loci in the reference metaphase chromosomes, 
and/or after blocking the binding sites for those repetitive 
sequences in the reference metaphase chromosomes by 
prehybridization with appropriate blocking nucleic acids, 
and/ or blocking those repetitive sequences in the labeled DNA 

10 by prehybridization with appropriate blocking nucleic acid 
sequences, and/or including such blocking nucleic acid 
sequences for said repetitive sequences dxiring said 
hybridization, wherein the DNA sequences in the l2Q)eled 
subject DNA that bind to single copy sequences in the 

15 reference metaphase chromosomes eure substantially retained, 
and those single copy DNA sequences as well as their binding 
sites in the reference metaphase chromosomes remain' 
substantially unblocked both before and during the 
hybridization; 

20 e) rendering the bound, labeled DNA sequences 

visualizable, if necesseury; 

f ) observing and/or measuring the intensity of the 
signal from the labeled subject DNA sequences as a function of 
position on the reference metaphase chromosomes; and 

25 g) comparing the copy numbers of different DNA 

sequences of the subject DNA by comparing the signal 
intensities at different positions on the reference metaphase 
chromosomes, herein the greater the signal intensity at a 
given position, the greater the copy number of the sequences 

30 in the subject DNA that bind at that position. An analogous 
method can be performed wherein the subject nucleic acid is 
SNA. 

Further, disclosed are methods whrein two or more 
subject nucleic acids are emalysed by CGH. Exemplary methods 
35 are those wherein the stibject nucleic acids are DNA sequenc s 
from a subject c 11 or cell population. Analogous methods may 
be performed wherein the subject nucleic acids are RNA. Such 
an exemplary method is that for compauring copy numbers of 
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different DNA sequenc s in one subject cell or cell population 
relative to copy numbers of substantially identical sequences 
in another cell or cell population, said method comprising the 
steps of: 

5 a) extracting tL^^ DNA from both of the subject cells 

or cell populations; 

b) amplifying said extracted subject DNAs, if 

necesscury; 

c) differentially labeling the subject DNAs; 

10 d) hybridizing said differentially labeled siibject 

DNAs in situ to reference metaphase chromosomes after 
substantially removing from the labeled DNAs those repetitive 
sequences that could bind to multiple loci in the reference 
metaphase chromosomes, and/ or after blocking the binding sites 

15 for those repetitive sequences in the reference metaphase 
chromosomes by prehybridization with appropriate blocking 
nucleic acids, emd/or blocking those repetitive sequences in 
the labeled DNA by prehybridization with appropriate blocking 
nucleic acid sequences, and/ or including such blocking nucleic 

20 acid sequences for said repetitive sequences dtiring said 
hybridization; 

e) rendering the bound, differentially labeled DNA 
sequences visualizable, if necessary; 

f) observing and/or measuring the intensities of the 
25 signals from each svibject DNA, and the relative intensities, 

as a fxinction of position along the reference metaphase 
chromosomeis ; and 

g) comparing the relative intensities eunong 
different locations along the reference metaphase chromosomes 

30 wherein the greater the intensity of the signal at a location 
due to one subject DNA relative to the intensity of the signal 
due to the other subject DNA at that location, the greater the 
copy nimber of the sequence that binds at that location in the 
first subject cell or cell population relative to the copy 

35 number of the substantially identical sequence in the second 
subject cell or cell population that binds at that location. 

Furth r disclosed are methods of quantitatively 
comparing copy numb rs of different DNA s qu nc Sf in one 
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sxobject cell or cell population relative t copy numbers of 
substantially identical sequences in another subject cell or 
cell population. A representative method is that comprising 
steps (a) through (e) of the method immediately detailed above 
5 and the following steps of: 

f . measuring the intensities of the signals from 
each of the bound subject DNAs and calculating the ratio of 
the intensities as a ftmction of position along the reference 
metaphase chromosomes to form a ratio profile; and 
10 g. quantitatively comparing the ratio profile among 

• different locations along the reference metaphase chromosomes^ 
said ratio profile at each location being proportional to the 
ratio of the copy number of the DKA sequence that bind to that 
location in the first subject cell or cell population to the 
15 copy number of siibstantially identical sequences in the second 
cell or cell population. 

Said representative methods can further comprise 
comparing copy numbers of different DNA sequences in more thaui 
two subject DNAs wherein the comparing is done pairwise 
20 between the signals from each subject DNA. 

This invention further discloses methods to 
determine the ratio of copy numbers of different DNA sequences 
in one subject cell or cell population to copy numbers of 
substantially identical sequences in another cell or cell 
25 population wherein the steps of (a) through (f ) as described 
above are performed as well as the following steps: 

g* determining the average copy number of a 
calibration sequence in both subject cells or cell 
populations, said calibration sequence being substantially 
30 identical to a single copy sequence in the reference metaphase 
cells; and 

h. normalizing the ratio profile calculated in (f ) 
so that at the calibration position, the ratio profile is 
equal to the ratio of the average copy numbers determined in 
35 (g) / the normalized ratio profile at any other location along 
the reference metaphase chromosomes thereby giving the ratio 
of the copy numbers of the DMA sequenc s in the two subject 
DNAs that bind at that location. That method can be extend d 
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to further subject nucleic acids as £ r exeuaple determining 
the ratio of copy nimbers of DNA sequences in more than two 
subject DNAs wherein the comparing is done pairwise between 
signals from each subject DNA. 
3 Further disclosed are methods for comparing copy 

numbers of different DNA sequences in a test cell or cell 
population, said method comprising applying steps (a) through 
(e) of the above-described methods and 

f • observing and/or measuring the intensities of 

10 the signal from each subject DNA, and the relative 

intensities, as a function of position along the reference 
metaphase chromosomes wherein one of the subject cells or cell 
populations is the test cell or cell population and the other 
is a normal cell or cell population; and 

15 (g) comparing the relative intensities among 

different locations along the reference metaphase chromosomes, 
wherein the greater the relative intensity at a location, the 
greater the copy number of the sequence in the test cell or 
cell population that binds to that location, except for sex 

20 chromosomes where the comparison needs to take into accotint 
the differences in copy numbers of sequences in the sex 
chromosomes in relation to those on the autosomes in the 
normal subject cell or cell population. 

A related representative method is that for 

23 comparing the copy niimber of different DNA sequences in a test 
cell or cell population comprising applying steps (a) through 
(e) of the abovB described methods wherein one of the subject 
cells or cell populations is the test cell or cell population, 
and the other is a standard cell or cell population wherein 

30 the copy numbers of the DNA sequences that bind to different 
positions on the reference metaphase chromosomes is known and 
steps: 

f . measuring the intensities of the signals from 
each of the bound subject DNAs and calculating the ratio of 

33 intensities as a function of position along the reference 
metaphase chromosomes to form a ratio profile; 

g. adjusting the ratio pr file at each 1 cation 
along the reference metaphase chromosomes by multiplying the 
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ratio profile by the known c py number of DNA sequ noes in the 
standard cell or cell population that bind there; and 

h. comparing the adjusted ratio profiles at 
different locations along the reference metaphase chromosomes 
5 wherein the greater the adjusted ratio profile at a location, 
the greater the copy number of the DNA sequence in the test 
cell or cell population that binds there. 

Another related representative method is that for 
determining the ratios of the copy numbers of different DNA 

10 sequences in a test cell or cell population, said method 

comprising applying steps (a) through (f ) of the immediately 
above-described method and the steps of adjusting the ratio 
profile at each location along the reference metaphase 
chromosomes by multiplying the ratio profile by the known copy 

15 number of sequences that bind there; and calculating the ratio 
of the copy number of a DNA sequence in the test cell or cell 
population that binds to one location on the reference 
metaphase chromosomes to the copy number of a sequence that 
binds to another location by dividing the adjusted ratio 

20 profile at the location of the first sequence by that at the 
location of the second. Said representative method can be 
extended to determine the copy nxmber of different DNA 
sequences in a test cell or cell population wherein steps (a) 
through (f ) 2ls described above are followed and then the 

25 following steps of adjusting the ratio profile at each 
location along the reference metaphase chromosomes by 
multiplying the ratio profile by the known copy number of DNA 
sequences in the standeurd cell or cell population that bind 
there; 

30 determining the copy number of a calibration 

sequence in the test cell or cell population that is 
substamtially identical to a single copy sequence in the 
reference cells; and 

normalizing the adjusted ratio profile so that at 

35 the location of the calibration sequence on the reference 
metaphase chromosomes, the normaliz d, adjusted ratio profile 
is equal to th copy number of the calibration s qu nee 
determined in the above step, the valu of the normalized, 
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adjusted ratio profile at another location then being equal to 
the copy nximber of the DNA sequence in the test cell or cell 
population that binds at that location* That method can be 
analogously performed wherein two or more calibration 
3 sequences are used, and the adjusted ratio profile is 
normalized to get the best fit to the copy numbers of the 
ensemble of calibration sequences. Preferably, the copy 
number of the calibration sequence is determined by in situ 
hybridization. Those methods can comprise in situ hybridizing 

10 probes for more than one calibration position and normalizing 
to obtain the best fit of the ratio profile to the calibration 
positions. The standard cell or cell population preferably 
have normal genomes. In many applications of CGH, the 
reference metaphase chromosomes are nosnnal. 

13 Further, this invention concerns the use of antenna 

cell lines. An exemplary method is for detecting 
amplification of a certain sequence or group of sequences in a 
subject cell or cell population, comprising essentially steps 
(a) through (e) of the above-described methods wherein the in 

20 situ hybridization is targeted to antenna cells in which the 
DNA sequence (s) to be tested for is or are amplified, and 
examining the reference cell for regions that sure hybridized 
significantly more intensely than others, the presence of such 
regions indicating amplifications of the sequence (s) which are 

23 being tested. The chromosomes of said antenna cell lines may 
be in interphase or in metaphase. 

When a single labeled subject nucleic acid is being 
hybridized, or if multiple labeled subject nucleic acids are 
hybridized sequentially, it is important that the binding 

30 sites on the reference genome not be saturated prior to 

observing and/or measuring the signal intensity ( ies) • In the 
case of a single labeled subject nucleic acid, nonsaturation 
can be effected in a number of ways, for example, by stopping 
the hybridization, by providing insufficient subject nucleic 

35 acid, and/or by providing a sufficient amotmt of xinlabeled 
nucleic acid which is sufficiently complementary to the 
refer nee chromosom s to competitively prevent saturation of 
sites therein by the labeled subj ct nucleic acid. 
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When there are two or m re laODeled subject nucleic 
acids, those subject nucleic acids can be hybridized in situ 
to the reference genome sequentially or simultaneously. 
Simultaneous in situ hybridization is preferred in that 
5 saturation of the targeted binding sites in the reference 

genome will not interfere with the procedure. When sequential 
in pitu hybridization is used, it must be performed under 
conditions wherein the individual hybridizations are stopped 
well before the binding sites on the reference chromosomes are 
10 saturated. 

Objects of this invention eure to detect sequence 
copy number imbalances throughout an entire genome in one 
hybridization, to map gains and/or losses of sequences in a 
genome, and/ or to provide a copy number karyotype of a subject 
15 genome. 

Further, an object of this invention is to enable 
the detection of relative copy number differences that are 
common to a number of different cells and/or cell populations. 
For example, CX3H methods can be used wherein ONAs extracted 

20 from cells of many different tumors are combined and labeled; 
the hybridization of those combined labeled DMAs to normal 
condensed chromosomes, provides for the rapid identification 
of only those copy number changes that occurred in most of th 
tumors. Less frequently occurring variations would be 

25 averaged out. Thus, this invention further provides for a CGH 
method wherein two or more of the subject nucleic acids that 
were extracted from different cells and/ or from numbers of 
cells from different cell populations, are labeled the same, 
and hybridized to a reference spread under conditions wherein 

30 repetitive sequences are removed and/or suppressed and wherein 
sequence copy number differences that are common in said 
combined labeled nucleic acid sequences are determined. 

Another object of this invention is to provide the 
means of cytogenetically analysing archived chromosomal 

35 material, that is, fixed material from, for example, biopsied 
tissue specimens, preferably cataloged and keyed t medical 
records of patients from whom the specimens wer taken, and 
archaeological chromosomal material. Such chromosomal 
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material cannot, of course, be karyotyped according to 
traditional means in that no live cells are present to culture 
and from which to prepare chromosomal spreads. However, the 
nucleic acid can be extracted therefrom and amplified by a 
5 polymerase chain reaction (PGR) procedure or by a non-PCR 
procedure and tested by the methods of this invention. 

This invention further provides for a method to 
detect simultaneously an ensemble of eunplif ications and/or 
deletions in a tumor wherein the results can be used to 

10 determine the subsequent behavior of that tumor. Said 
determination is made by associating the patteims of 
amplifications and/or deletions in tumor cells with the. 
behavior of that tumor. Such associations cam be made by 
testing, for example, as indicated immediately above, DNA from 

15 archived tumor tissue keyed to medical records, or when fresh 
tximor specimens axB tested by C6H and the patients are 
followed. Further, such associations can be made with CGH 
methods wherein there are more than one subject cell and/or 
cell population, for example, one or more tumors. 

20 Another object of this invention is to provide a 

method of analyzing cells from a suspected lesion at an early 
stage of development. An advantage of the methods of this 
invention is that only a few cells eire necessary for the 
analysis. The early detection of zunplif ications and/or 

25 deletions in cells from a lesion allow for early therapeutic 
intervention that can be tailored to the extent of, for 
example, invasiveness known to be associated with such genetic 
rearrangements. Further, such early detection provides a 
means to associate the progression of the cells with the 

30 genetic rearrangements therein detected by the methods of this 
invention. 

Tumors can be karyotypically heterogeneous 
containing therein various populations of cells each having 
different types of genetic rearremgements. As indicated above 
35 ttamor cells are difficult to culture, and it is not cleeur that 
cultured cells are representative of th original ttmor cell 
p pulati n. This inv ntion provides the means t by-pass the 
culturing obstacle and allows genetic characterization of 
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txmor cells and thus, of the heterogeneity of turn rs by 
testing cells from different sxibregions thereof according to 
the methods of this invention. Bulk extraction of the nucleic 
acid from many cells of a tumor cem also be used to test for 
5 consistent amplifications and/or deletions within a tumor. 

It is cuiother object of this invention to provide 
methods of detecting amplifications and/or deletions of 
nucleic acid sequences wherein certain cell lines termed 
herein "antenna cell lines", axa used to enhance the 

10 sensitivity of the detection. 

It is still further an object of this invention to 
provide methods of prenatal or perinatal emalysis wherein the 
nucleic acid of the child's cells is extracted and tested 
according to the methods of this invention. In one embodiment 

15 of CGH, such material is human and hybridized to a normal 

htman metaphase spread to detect whether any deletions and/ or 
amplifications are therein present, for example, an extra copy 
of chromosome 21, diagnostic for Down syndrome. Test kits for 
performing C6H methods are also provided. 

20 

BRIEF DES CRIPTIQK OF THE FTGURES 

f j,q\Ure X schematically illustrates the general 
approach used in performing the methods of this invention — 
comparative Genomic Hybridization (CGH) . The reference 

25 chromosome spread is hybridized with various nucleic acid 
mixtures, either simultaneously or at different times, to 
obtain the desired information. Representative mixtures could 
include unlaUseled sequences designed to block sequences in the 
various other nucleic acid pools, for exaa^le^ the high-copy 

30 repetitive sec[uences in human genomic DMA; unlabeled 

competitor nucleic acid to prevent saturation of the target 
sites for the labeled mixtures, for example, human genomic DNA 
within a factor of 10 of the concentration used for the 
labeled subject nucleic acids (see Figure 4); and one or more 

35 pools of sequences of different origin that are differently 
labeled so that their binding can be independently assessed, 
for example^ tximor and normal genomic DNA (see Figures 5 and 
6) . The inf omaation on the sequence frequency of the labeled 
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p ols is obtained by analysis of the intensity of the 
individual signals and/or the differences in ratios of 
intensities among the signals as a fxinction of position along 
the reference chromosomes. 
^ Ficmre 2 outlines general aspects of the CGH 

procedure used in Example l, infra . The reference chromosome 
spread, in this exeunple normal human chromosomes, is first 
hybridized for about one hour with a high concentration of 
unlabeled human genomic DNA (Figiure 2A) . That 

10 prehybridization blocks many of the high copy repetitive 

sequences in the chromosomes so that the high copy repetitive 
sequences in the labeled subject nucleic acid, in this case 
labeled tumor DNA, will not substantially contribute to the 
signal during the subsequent hybridization. The labeled tumor 

15 DNA, and perhaps some competitor DNA or other comparison 
nucleic acid are then hybridized to the target reference 
spread (Figure 2B) . Cot-1 DNA can be included in the 
hybridization as in Example l, below to block more effectively 
the centromeric repetitive sequences in the labeled subject 

20 nucleic acids. 

Figure 2 is representative of one way of reducing 
signals from repetitive sequences. Other methods are detailed 
herein Infra. In each of the CGH methods including the 
procedures outlined in the rest of the figures, some means of 

25 reducing the signal from the repetitive sequences is used, but 
not specifically indicated in the figures. It is important 
for CGH that the signal from each subject nucleic acid be 
dominated by sequences that bind to well-defined loci. Total 
suppression of the signal from the genomic repeats is not 

30 necessary, but the poorer the suppression, the less able the 
procedure is to detect small differences in sequence 
frequency. 

Fiqur? 3 further illustrates the procedure used in 
Example 1. As shown in Figure 3A, labeled human tumor DNA is 
35 hybridized to a normal human chromosome spread. [As indicated 
in the description for Figure 2, provisions were made to 
suppress the signal fr m the repetitive sequences alth ugh the 
provisions are not sp cifically indicated in the figure. 
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Example l details a preferred method to suppress the 
hybridization signals from repetitive sequences.] In this 
representative example, the tumor DNA is assumed to contain a 
region wherein some sequences are highly amplified, for 
5 e:»:ample, an amplicon containing an oncogene. The amplified 
sequences in the tumor DNA may be clustered and integrated in 
some tumor chromosomes; they may be integrated into multiple 
places in the ttmor genome; or, they may exist as extra- 
chromosomal elements. The sequences of the amplicon will map 

10 to some chromosomal location in the reference genome, which in 
this case is a normal human genome. 

Figure 3B illustrates the kinetics of the build-up 
of the signal on a target reference chromosome. The signal 
builds more rapidly in the amplified region since more copies 

15 of those sequences are available for hybridization. If the 
reaction is stopped before the target chromosome is saturated, 
or if insufficient labeled DNA is added to achieve saturation, 
then the genomic region that was amplified in the tumor will 
appecur higher in intensity on the normal chromosome as 

20 illustrated by the band with the denser shading on the left 

reference chromosome. The more intensely labeled region (band 
with the denser shading) indicates the location and extent of 
the amplicon as reflected in the reference genome. Thus, the 
amplification is detected without prior knowledge of its 

25 existence, and the origin of the amplified sequences is mapped 
in the normal human genome. 

If the reaction illustrated in Figxire 3 is allowed 
to proceed to saturation of the target sites, contrast is 
lost, as shown by the representative reference chromosome on 

30 the right wherein the amplicon cannot be distinguished. Thus, 
in this embodiment of CGH, it is important to stop the 
hybridization before saturation of the taurget or provide 
insufficient probe for saturation. The graphs schematically 
show the build-up of the hybridization signal in the region 

35 that was amplif i d (graph on right) emd in the remainder that 
was unamplif ied (graph on left) . The arrows connect the 
chromosomal regions with the tim s of observation on the 
kinetic curve. 
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Figure 4 illustrates an embodiment of CGH that 
avoids the potential saturation of the taurget as shown in the 
right portion of Figtxre 3B. In this representative example, 
the reference nucleic acid is a human chromosome spread; the 
5 subject nujleic acid is labeled tumor DNA (4A) . If iinlabeled 
human genomic DNA is included with the labeled tumor DNA in 
excess, in this case at a five*fold higher concentration them 
that of the lad3eled tumor DNA, then any satiuration of the 
target will be due to a combination of labeled and unlabeled 

10 copies of the nucleic acid sequences, rather than just labeled 
copies as shown in the right portion of Figure 3B. [Once 
again, as indicated in Figures 2 and 3 the means of reducing 
the signal from repetitive sequences is not indicated in this 
figure, but it is assumed that a protocol is performed to. 

15 remove substantially the repetitive sequences that would bind 
to multiple loci in the reference genome and/or to block such 
sequences from binding to the target.] 

At the early stages of the reaction, the aunplified 
region will build up faster than elsewhere in the chromosome 

20 (for example if the sequence is as^lif led five-fold, it would 
build up 5 times as fast) and will be detectable as in the 
left portion of Figure 3B. However as the reaction proqeeds 
to saturation, the unamplified regions of the chromosome reach 
only one-fifth (1/5) of the intensity shown in the right 

25 portion of Figure 33, because most of the sites are filled by 
unlabeled copies of the sequences. On the other hand, a 
sequence that was amplified five-fold in the tumor would reach 
one-half (1/2) of the saturation intensity since an equal 
number of leUdeled and unlabeled copies of those sequences sure 

30 present. Thus, contrast is maintained according to this 

embodiment at all stages of the reaction (as shown in Figure 
4B) , although it changes as the reaction proceeds. 

Figure 5 illustrates an embodiment of CGH designed 
to enhance its sensitivity in detecting small changes in copy 

35 number of various sequences. When a CGH procedure as 

indicated in Figure 4 is followed, intrinsic variation in th 
saturation levels, or rate of signal build-up at different 
positions in th ref erenc genome may n t be indicative of 
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abnormal gain or loss of sequences. Such intrinsic variations 
would interfere with interpretation of intensity differences 
as indicating differences in copy number of the sequences. 
This CGH embodiment overcomes that potential problem by 
5 providing a mixture of labeled stxbject nucleic acid, in this 
case tumor DNA labeled with a green f luorochrome, and a 
differently labeled competitor nucleic acid in this case 
normal human genomic DNA labeled with a red f luorochrome. The 
two differently labeled DNAs are simultaneously hybridized to 

10 the chromosome spread. [Once again, removal of the repetitive 
• sequences and/or blocking of the signal therefrom is performed 
but not illustrated. ] Changes in the ratio of green to red 
along each of the chromosomes in the reference spread then 
indicate regions of increased or decreased sequence copy 

15 nx2mber in the tumor. Those ratio chemges may result in color 
variations from red to yellow to green on the reference 
spread. 

figyre 6 graphically and schematically explains the 
kinetics underlying the CGH embodiment illustrated in Figure 

20 5. Iii the center is one of the chromosomes of the reference 
chromosome spread, a normal humsm chromosome in this case. 
The darkness of the shading on the reference chromosome shows 
the ratio of green to red intensity along the chromosome. 

In the aa^lif ied region, the green/red ratio is much 

25 higher than in the normal region, whereas in the deleted 

region the green/red ratio is less than in the normal region. 
The arrows from examples of each of the different green/red 
intensity regions point to kinetic curves that indicate the 
build-up of green (solid line for the tumor DNA) and red 

30 (dashed line for the normal DNA) signals during the 

hybridization. In the normal region, upper left graph, the 
red and green signals build together. (They have been 
normalized to be equal for the purposes of this explanation.) 
In the amplified region, upper right, the green (tumor) signal 

35 builds up much more rapidly than the r d (n rmal) signal, the 
green/ red ratio being approximately the level of amplification 
(given the normalization to th normal part of the 
chromosome) . 
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In the lower left of Figure 6, the signal build-up 
for the duplicated region is shown; the green (tumor) signal 
is 50% brighter than the red (normal) signal. In the lower 
right, the build-up for a deleted region is schematically 
5 described; the green (tumor) ^:iignal is 50% dimmer than the red 
(normal) signal. The ratio approach of this CGH embodiment 
further normalizes for the frequent finding that hybridization 
to some chromosomes in a spread is intrinsically brighter than 
that for others because of differences in the local 
10 hybridization environment. 

riqWTfi 7 graphically illustrates the correlation of 
the nximber of X chromosomes in five fibroblast cell lines and 
the average green-to-red ratio of the X chromosome (s) relativ 
to the same ratio for the autosomes. 
15 Fi^qure 9 illustrates green-to-red fluorescence ratio 

profiles of chromosomes 1, 9, ii, 16 and 17 after comparative 
genomic hybridization with breast cancer cell line 600PE 
(green) and with a normal DNA (red) . The profiles reflect th 
relative copy number of the chromosomal regions. Fluorescence 
20 in s4tU hybridization (FISH) with 16p and 16q cosmid probes to 
interphase and metaphase 600PE cells indicated that there were 
two signals with 16p cosmid probes and one signal from the 16q 
cosmid probes. That information on the absolute copy number 
of those loci provided by FISH permits interpretation of the 
25 ratio 1.0 as indicating that there are two copies of the 
sequence throughout the genome. 

The dip in the profile at lp34 through lp36 may 
represent a previously unsuspected small interstitial 
deletion; however, that observation has not yet been 
30 independently verified with specific probes for that region. 

Centromeric and heterochromatic regions of the 
genome are not included in the analysis because the Cot-1 DNA 
partially blocks signals in those regions, and the large copy 
number polymorphisms between individual sequences at those 
35 loci effect unreliable ratio data. 

Figure 9A and 9B respectively provide gr n-to-red 
fluorescence ratio profiles of chromes me 8 (Figure 9A) and 
chromes me 2 (Figure 9B) after comparative genomic 
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hybridization respectively with COLO 320 HSR (hximan colon 
adenocarcinoma cell line) and NCI H69 (small cell liing 
carcinoma cell line) cell line DNAs (green) and with normal 
human DNA (red) . 

5 In Figure 9A, the myc locus at 8q24 shows a highly 

elevated green-to-red ratio,, which is consistent with the 
known high level amplification o£ myc in the COLO 320HSR cell 
line. 

In Figure 9Br three regions of as^lif ication are 
10 seen on chromosome 2. The signal at 2p24 corresponds to the 
location of N- myc known to be amplified in the NCI-H69 cell 
line. The two other regions with a highly increased green-to- 
red fluorescence ratio, at 2p21 and 2g21, were not previously 
known to be amplified in the KCI-H69 cell line. 

15 

"^^TTf^ nESCRIPTION 
Comparative Genomic Hybridization (C6H) has also 
been termed Copy Ratio Reverse Cytogenetics (CRRC) , 
competition hybridization and quantitative in situ ratio 

20 karyotyping (QUIRK) • Further, in the embodiment wherein 
fluorochromes are used as labels, it has been termed 
competition FISH (fluorescence in situ hybridization). CGH 
specifically provides methods whereby amplifications, 
duplications and/or deletions can be identified in an 

25 immediate overview of a genome. 

CGH provides methods for determining veuriations in 
the copy number of different elements in a mixture of nucleic 
acid sequences (for example, genomic DNA isolated from a 
tumor) as a function of the location of those sequences in the 

30 genome of a reference organism (for example, the genome of a 
normal cell from the same species) • The methods con^rise the 
use of in situ hybridization of the nucleic acid sequence 
mixture to a chromosome spread of the reference orgemism, and 
meastiring the intensity of the hybridization at different 

35 locations along th teurget chromosomes. Exemplsury methods sure 
schematically outlined in Figures 1-6. Those illustrative 
exeunples are not exhaustive but suggest the wide remge of 
variations and other uses of the basic approach. 
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As the figure d scriptions Indicate, it is critical 
that signals from repetitive sequences do not dominate the 
signal from the subject nucleic acid pool, and that they be 
removed from the pool or that their signals be suppressed as 
5 necessary. It is preferred to exclude sequences from the 
hybridization or block sequences in the hybridization mixture 
that could bind to multiple clearly separated positions on the 
chromosomes, for example, sites that are on different 
chromosomes, or that are on the same chromosome but are well- 
10 separated, in many applications of CGH, it is the high copy 
repetitive sequences, such as Alu, Kpn, Lines, and alpha- 
satellites among others, that are removed from the labeled 
subject nucleic acid and/or which are blocked and/or the 
binding sites therefor are blocked. Described herein are 
15 methods to remove and/or block those repetitive signals. It 
should be noted that nucleic acid sequences in the labeled 
nucleic acid that bind to single copy loci are substantially 
retained in the hybridization mixture of labeled subject 
nucleic acids, and such single copy sequences as well as their 
20 binding sites in the reference chromosome spread remain 

substantially unblocked relative to the repetitive sequences 
that bind to multiple loci (that is, loci that are visually 
distinguishable) both before and during the hybridization. 

The methods of this invention provide the means to 
25 identify previously unknown regions of amplification and 

deletion. For example, one embodiment of CGH as detailed in 
Example l herein provides an efficient method that gives an 
immediate overview of a genome identifying all regions that 
are amplified greater than about five-fold to ten-fold as well 
30 as at least large deletions. More sensitive embodiments that 
can identify smaller amplifications and deletions are also 
disclosed. 

Nanogram quantities of the stibject nucleic acids are 
required for the CGH methods of this invention. Paraffin 
35 embedded tumor sections can be used as well as fresh or f r zen 
material. Snap frozen material from normal and malignant 
tissue are preferred for mRNA isolation. 
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Standard procedures can be used t isolate the 
required nucleic acid from the subject cells. However, if the 
nucleic acid, for example, DNA or mRNA, is to be extracted ' 
from a low number of cells (as from a particular tumor 
5 subregion) or from a single cell, it is necesary to amplify 
that nucleic acid, by a polymerase chain reaction (PCR) 
procedure or by a non-polymerase chain reaction (non-PCR) 
procedxire. PCR and preferred PCR procedures are described 
infra. Exemplary non-PCR procedures include the ligase chain 

10 reaction (LCR) and linear an^lification by use of appropriate 
primers and their extension (random priming). 

Some of the various embodiments of CGH are 
illustrated, particularly in Figures 1-6. In the embodiment 
illustrated in Figures 5 and 6, wherein a subject nucleic 

15 acid, in this case, human genomic DNA, that is leOieled 

differently from another subject nucleic acid, aa^lifications 
and/or deletions are indicated by a change in ratio between 
the different signals, rather than just a change in signal 
intensity. 

20 The representative exan^les conceiming CGH of 

Examples 1, 2 and 3 below involve the hybridizations of tumor 
cell line DNA to normal human metaphase spreads. However, 
there are many permutations and combinations of pairwise and 
multiple hybridizations of different nucleic acids from 

25 different genomes all of which are considered to be within the 
scope of this invention. 

For exaii?>le, CGH could be used to hybridize labeled 
DNA from a tumor cell line to metaphase spreads of that same 
cell line to estimate the level and pattern of amplification 

30 in each cell line, comparing those results to hybridizations 
of said tumor cell line DNA to a normal human metaphase 
spread. Alternatively, labeled tumor cell line DNA and 
differently labeled human genomic DNA could be simultaneously 
hybridized to a metaphase spread of a tumor cell line 

35 metaphase spread. Further, DNA from a primary tumor and that 
from its metastasis could be differently labeled and 
hybridiz d in a CGH method to a normal human metaphase or to a 
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related tumor cell line m taphase. Those are just some of the 
many examples of CGH. 

Although the exeunples herein concern the 
hybridizations of the DNA from breast cancer cell lines and 
5 primary tumors to normal human metaphase spreads, it will be 
clear to anyone skilled in the art that CGH is not limited to 
studying genomes of cancer cells or to the results of 
hybridizing abnormal genomes to normal genomes. CGH permits 
the comparison of nucleic acid sequence copy frequencies of 

10 any two or more genomes, even genomes of different species if 
their nucleic acid sequences are sufficiently complementary to 
allow for meaningful interpretation. It should be noted 
regarding interspecies comparisons that the information 
obtained by CGH includes not only an assessment of relative 

15 copy number but also that of sequence divergence. 

It will also be clear to those skilled in the art 
that hybridization with nucleic acid other than chromosomal 
DNA, such as messenger RNA (mRNA) or complementary DNA (cDNA) 
of subject cells can be used to determine the location and 

20 level of expression of genes in those ceils. Conventional 
methodology is used to extract mRNA from a cell or cell 
population, and to synthesize in vitro cDNA by reverse 
transcription. 

CGH does not require the preparation of condensed 

25 chromosomes, for example, metaphase, prophase or other 

condensed chromosomal states, of the subject genomes. Thus, 
genomes irom which metaphase, prophase or otherwise condensed 
chromosomal spreads are difficult, time-consuming or not 
possible to prepare at least in good quality, for example, 

30 genomes of tumor cells or fetal cells can be studied by CGH. 

In CGH, labeled subject nucleic acids, for exaa^le, 
labeled tumor DNA, is hybridized to a reference genome, for 
example, a normal human metaphase spread, under conditions in 
which the signal from amplified, duplicated and/or deleted 

35 nucleic acid sequences from the labeled nucleic acid can be 
visualized with go d contrast. Such visualizati n is 
accomplish d by suppressing the hybridization of repetitive 
sequences that bind to multiple loci including th high copy 
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interspersed and clustered repetitive sequences^ such as, Alu, 
Kpn, Lines, alpha-satellites among others, using xinlabeled 
total human genomic nucleic acid, preferably DNA, and/or the 
repeat-enriched (Cot-l) fraction of genomic DNA, and/or by 
5 removing such repetitive sequences from the hybridization 

mixture, in providing the detection sensitivity required, the 
extent of suppression of the hybridization of repetitive 
sequences and/ or removal thereof can be adjusted to the extent 
necessary to provide adequate contrast to detect the 
10 differences in copy number being sought; for example, subtler 
copy number changes may require the suppression or removal of 
lower level repetitive sequences. 

When combining more than one labeled nucleic acid in 
a hybridization mixture, the relative concentrations and/or 
15 labeling densities may be adjusted for various purposes* For 
example, when using visual observation or photography of the 
results, the individual color intensities need to be adjusted 
for optimum observability of changes in their relative 
intensities. Adjustments can also be made by selecting 
20 appropriate detection reagents (avidin, antibodies and the 

like) , or by the design of the microscope filters among other 
parameters. When using quMtitative image analysis, 
mathematical normalization can be used to condensate for 
general differences in the staining intensities of different 
25 colors. 

The kinetics of the CGH hybridizations are 
complicated. Since the subject nucleic acids are frequently 
double stranded, complementeury sequences will reassociate in 
the hybridization mix as well as hybridizing to the target. 
30 Such reassociation may result in a more rapid decrease in 
concentration of the high copy sequences than the low copy 
ones, thereby making the signal intensity variations on the 

^ence chromosomes less pronounced than the copy 
differences in the original subject DNAs. In addition, non- 
35 specific binding of the labeled subject DNAs to the slide, 

coverslip, tc. may generally reduce the c ncentrati n of that 
labeled subject nucleic acid during the hybridization. Those 
sltilled in the art will recogniz numerous methods of 
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optimizing the quantitative aspects of CGH, such as, 
mathematical correction of digital images, supplying freshly 
denatured subject DNA during the hybridization, and adding 
unlabeled genomic DNA in excess to dominate the reassociation 
5 rates. The term "saturation" is defined in the context of 
hybridization kinetics. 

The resolution of CGH is presently at a level that 
can be seen through a light microscope, as is traditional 
cytogenetic staining. Thus, if a small sequence in a sxibject 

10 nucleic acid is amplified, to be seen as a signal in a subject 
genome, it must be amplified enough times for its signal to be 
able to be visualized under a light microscope. For example, 
the locus for erbB-2 which is relatively small (very 
approximately, a few hundred kb) , needs to be amplified at 

15 least greater than five times to be visually distinguishable 
under a light microscope when the CGH embodiment used in 
Example 1 is employed. On the other hand, if a large section 
of a chromosome is present at increased frequency in a subject 
nucleic acid, the signal from that region would show up in the 

20 reference genome at a much lower level of amplification. 

The term "labeled" is herein used to indicate that 
there is some method to visualize nucleic acid fragments that 
are boxind to the target, whether or not the fragments directly 
carry some modified constituent. A section infra entitled 

25 "Labeling the Nucleic Acid Fragments of the Subject Nucleic 
Acids" describes varioxis means of directly labeling the probe 
and other leUt>eling means by which the bound probe can be 
detected. 

The phrase "antenna cell line" is herein used to 
30 indicate a reference genome that has one or more known 
significant genetic aberrations, for example, a cell line 
known to have an oncogene that is highly amplified, for 
exan5)le, in large homogeneously staining regions (HSRs) . The 
amplified regions of that cell line would thus provide a much 
35 bigger target site than a n rmal chr mos me spr ad. Thus, 

obs rvation of the signal from such a larg targ t site would 
be easier in that on average the signal would b brighter from 
amplified target sequences in the r ference genome as pr vided 
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by such an antenna cell line. A subject nucleic acid 
extracted from, for example, a number of tumor cells, could be 
tested by a CGH hybridization to such an antenna cell line to 
see if it also contained amplif ication(s) of the oncogene 
5 known to be amplified in the cell line. 

When an antenna cell line is used as the reference 
genome, there are instances wherein it can be used in 
interphase rather than as a chromosome spread. For example, 
if one is checking to see if a certain oncogene is amplified 
10 or not in the subject nucleic acid,, interphase CGH is 

sufficient. However, the maximum amount of information is 
provided when condensed chromosome spreads are used. 

A base sequence at any point in the genome can be 
classified as either "single-copy" or "repetitive". For 
15 practical ptirposes the sequence needs to be long enough so 
that a complementary probe sequence can form a stable hybrid 
with the t£irget sequence under the hybridization conditions 
being used. Such a length is typically in the range of 
several tens to hundreds of nucleotides. 
20 A "single-copy sequence" is that wherein only one 

copy of the target nucleic acid sequence is present in the 
haploid genome. "Single-copy sequences" are also known in the 
art as "unique sequences". A probe complementary to a single- 
copy sequence has one binding site in haploid genome. A 
25 "repetitive sequence" is that vrtierein there is more than one 
copy of the same target nucleic acid sequence in the genome. 
Each copy of a repetitive sequence need not be identical to 
all the others. The important feature is that the sequence be 
sufficiently similar to the other members of the fauaily of 
30 repetitive sec[uences such that under the hybridization 

conditions being xised, the same fragment of probe nucleic acid 
is capaOjle of forming stable hybrids with each copy. 

Herein, the terms repetitive sequences, repeated 
sequences and repeats are used interchsmgeably. 
^5 "Eh phrase "metaphase chromosomes" in herein defined 

to encompass the concept of "c ndensed chromos mes" and is 
d fined to mean not only chromosomes cond nsed in the prophas 
or metaphase stage of mitosis but any condensed chromosom s, 
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for example, those condensed by premature chromosome 
condensation or at any stage in the cell cycle wherein the 
chromosome can be visualized as an individual entity. It is 
preferred that the chromosomes in the reference genome be as 
long as possible but condensed sufficiently to be visualized 
individually. 

A subject nucleic acid is herein considered to be 
the same as amother nucleic acid if it is from a member of the 
same sex of the same species and has no significant 
cytogenetic differences from the other nucleic acid. For 
example, the DNA extracted from normal lymphocytes of a human 
female is considered for the purposes of this invention to be- 
the same nucleic acid as that of DNA from normal cells of a 
human female placenta. 

The following abbreviations are used herein: 
Afc)^y?Y3T»ta,on? 

AAF - N-acetoxy-N-2-acetyl-aminofluorene 
ATCC - American Type Culture Collection 
BN - bicarbonate buffer with NP-40 



Brd/ 
Urd 

BRL 

bp 

CCD 

C6H 

Chr. 

CHL 

CRRC 

DAPI 

dATP 

DCS 



bromodeoxyuridine 

Bethesda Reseaurch Laboratories 
base pair 

charge coupled device 
Compeurative Genomic Hybridization 
chromosomal 

chronic myelogenous leukemia 

Copy Ratio Reverse Cytogenetics 

4 , 6-diamidino-2-phenylindole 

deoxy adenosine triphosphate 

as in fluorescein-avidin DCS (a commercially 
available cell sorter grade of fluorescein 
Avidin D) 



dCTP - deoxycytosine triphosphate 
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dGTP - deoxyguan sine triphosphate 

DI - DNA index 

DM - double minute chromosome 

dNTP - deoxynucleotide triphosphate 

dTTP - deoxythymidine triphosphate 

dUTP - deoxyuridine triphosphate 

EDTA - ethylenediaminetetraacetate 

E/P - estrogen/progesterone 

FISH - fluorescence in situ hybridization 

FACS - fluorescence-activated cell sorting 

FITC - fluorescein isothiocyemate 

HFLC - high performance liquid chromatography 

HSH - homogeneously staining region 

ISCN - International System for Cytogenetic 
Nomenclature 

IB - isolation buffer 

kb - kilobase 

kDa - kilodalton 

LCR - ligase chain reaction 

LOH - loss of heterozygosity 

Mb - megaQ>ase 

met. metastasis 

min - minute 

ml - milliliter 

mM - milliMole 

mm - millimeter 

ng - nanogram 

NIGMS - National Institut of General Medical Sciences 

NP-4D - non-ionic detergent commercially availaUale from 
Sigma as Nonidet P-40 (St. Louis, HO) 
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PBS - phosphate-buffered saline 

PGR - polymerase chain reaction 

PHA - phytohemagglutinin 

PI * propidixm iodide 

5 PI. pleiiral 

PMSF - phenylmethylsulfonyl fluoride 

PN - mixture of 0«1 M NaH2P04 and 0.1 M 

buffer NajHPO^, pH 8; 0.1% NP-40 

PNM - Pn buffer plus 5% nonfat dry milk 

buffer (centrifuged) ; 0.02% Na aside 

10 QUIPS - quantitative image processing system 

QUIRK - quamtitative in situ ratio karyotyping 

Rb-l - retinoblastoma tumor suppressor gene 

RFLP - restriction fragment length polymorphism 

RPM - revolutions per minute 

15 SD - Stemdard Deviation 

SDS - sodium dodecyl sulfate 

SSC - 0.15 M NaCl/0.015 M Na citrate, pH 7 

Td *- doubling time 

ug - microgram 

20 ul - microliter 

urn - micrometer 

uM - micromole 

VNTR - varieQ)le ntimber tandem repeat 



Resolution of differences in copy number can be 
25 improved by the use of image analysis and by averaging the 
results from hybridizations of a subject nucleic acid to 
multiple c ndensed chr mosome spreads. Using such methods, 
the backgr und signal (noise) can be diff rentiated from 
actual nucl ic acid sequence copy number differences. 
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Image Analysis; 

An image analysis system, preferaibly computer- 
assisted, can be used to enhance and/or accurately qpiantitate 
the intensity differences between and/ or euaong the signals 
5 from a hybridization and the background staining differences 
for more accurate and easier interpretation of results. Image 
analysis and methods to measure intensity are described, for 
example, in Hiraoka et al.. Science. 238 ; 36-41 (1987) emd 
Aikens et al., Meth, Cell Biol>. 29 ; 291-313 (1989). In such 

10 an image analysis system, it is preferred to use a high 

quality CCD csunera whose intensity response is known to be 
lineaur over a wide range of intensities. 

The components of a particulcur quantitative image 
processing system (QUIPS) are described in Example 1 under the 

15 stibheading Fluorescence Microscopv a nd Interpretation of 
Results > As exemplified in Example 1, a computer-assisted 
image analysis system with a f iltejrwfaeel is used so that the 
images from the signals and counterstaining of the DNA are 
superimposed on one image. Pseudocolors, that is, colors that 

20 are not exactly spectrally converted, can be displayed. 
Contrast stretching, wherein the differences between the 
intensity levels of the signals and background staining . 
differences are enhanced by adjusting controls of the image 
emalysls system. Thresholding can also be used wherein the 

25 background staining can be assigned a value close to zero so 
it would barely appear in the processed image from such a 
system. Similarly, computer analysis permits substraction of 
background, smoothing of fluctuations in the signals, accturate 
intensity and ratio calculations and the ability to average 

30 signals on chromosomes in multiple spreads. 

Hybridization of the subject DKAs to the reference 
chromosomes gives information on relative copy nimbers of 
35 sequences. Some additional normalization is required to 
obtain absolute copy number information. One convenient 
method to do this is to hybridize a probe, for example a 
cosmid specific to some singl locus in the normal haploid 
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genome, to the interphase nuclei of the sxibject cell or c 11 
population (s) (or those of an equivalent cell or 
representative cells therefrom, respectively) . Coxinting the 
hybridization signals in a representative population of such 
5 nuclei gives the absolute sequence copy number at that 

location. Given that information at one locus, the intensity 
(ratio) information from the hybridization of the subject 
DNA(s) to the reference condensed chromosomes gives the 
absolute copy number over the rest of the genome. In 

10 practice, use of more than one reference locus may be 
desirable. In this case, the best fit of the intensity 
(ratio) data through the reference loci would give a morid 
accurate determination of absolute sequence copy number over 
the rest of the genome. 

15 Thus, the CGH methods of this invention combined 

with other well-known methods in the art can provide 
information on the 2d>solute copy niuabers of substantially all 
RNA or DNA secjuences in stibject cell(s) or cell population (s) 
as a function of the location of those sequences in a 

20 reference genome. For example, one or more chromosome- 
specific repeat sequence or high complexity painting probes 
can be hybridized independently to the interphase nuclei of 
cells representative of the genomic constitution of the 
siibject cell(s) or cell population (s) . Whole chromosome 

25 painting probes are now available for all the htunan 

chromosomes [Collins et al.. Genomics, li s 997-1006 (1991)]. 
Specific repeat-sequence probes are also available [Trask et 
al.. Hum. Genet., 78: 251 (1988) and references cited 
therein; €uid commercially avail2U3le from Oncor (Gaithersburg, 

30 MD, USA)]. Hybridization with one or more of such probes 

indicates the eJ^solute copy numbers of the sequences to which 
the probes bind. 

For such interphase analysis, painting probes with a 
complexity of from about 35 Icb to about 200 kb, are preferred; 

35 probes from about 35 kb t about 100 Icb are further preferred; 
and still more pref rred ar probes having a c mplexity of 
from about 35 Icb to 40 kb, for example, a cosmid pr be. 
Exemplary of such locus-specific painting pr bes are any 
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cosmid, yeast artificial chromosomes (YACs) , bacterial 
artificial chromosomes (BACs) , and/ or pi phage probes as 
appropriate, preferably to the arms of a selected chromosome « 
Such cosmid probes, for example, sure commercially availeddle 
5 from Clontech [South San Francisco, CA (USA)] which supplies 
cosmid libraries for all the hxamem chromosomes. Another 
example of a cosmid probe that could be used in such methods 
of this invention would be a 3p cosmid probe called CC13-787 
obtained from Yusuke Nakamura, M.D., Ph.D^ [Division of 

10 Biochemistry, Cancer Institute, Toshima, Tokyo, 170, Japan]. 
Its isolation and mapping to 3p21.2-p21.1 is described in 
Yamakawa et al., Genomics . £(3): 536*543 (1991). Another 
example would be a 3q cosmid probe named J14R1A12 obtained 
from Wen-^Lin Kuo [Biomedical Depeurtment^ P.O. Box 5507 (L- 

15 452] , Lawrence Livermore National Leiboratory Livermore, CA 
94550 (USA)]. For interphase analysis, preferred repeat 
sequence probes are centromeric-specific and/or peri- 
centromeric-specif ic repeat sequence probes. Such a 
centromeric-probe is, for example, the chromosome 17 peri- 

20 centromeric repeat probe (cosmid ckl7.lO) and the alpha 
satellite repeat probe for the centromeric region of 
chromosome 8, both of which eure described in Example 1 infra , 
A variety of repeat sequence probes aure commercially available 
from Oncor [Gaithersburg, MD (USA)]. However, the locus- 

25 specific painting probes are preferred over the repeat 

sequence probes for the methods of this invention to determine 
absolute copy numbers of nucleic acid sequences. 

Further, when the subject nucleic acid sequences sure 
DNA, the reference copy nimbers can be determined by Southern 

30 emalysis. When the sxibject nucleic acid sequences are RNA, 
the reference copy numbers can be determined by Northern 
analysis . 

Those reference copy numbers or reference 
frequencies provide a stemdeurd by which substemtially all the 
35 RNA or DNA sequences in the subj ct cell(s) or cell 

population (s) can be determin d. CGH methods are used to 

determine the relativ copy numbers of the rest of the 

s quences. However, absolute copy numJders require a standard 
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against which the results f CGH can be determined. Otherwise 
the CGH procedures would have to be highly standardized and 
quantitated to see differences in the absolute copy nuiabers of 
sequences in a genome, for example, haploidy, triploidy, 
5 octaploidy, wherein there are 1, 3 and 8 copies of ^^.ach of the 
chromosomes, respectively. 

pgR and Microdissection: 

The mechanics of PGR axB explained in Saiki et al., 

10 fieience, 230 ; 1350 (1985) and U.S. Patent Nos. 4,683,195, 
4,683,202 (both issued July 18, 1987) and 4,800,159 (issued 
January 24, 1989).] PGR offers a rapid, sensitive and 
versatile cell-free molecular cloning system in which only 
minute eunounts of starting material are required. 

15 A preferred PGR method to amplify the subject 

nucleic acids for testing by CGH is a PGR adapter- linker 
ao^lification [Saunders et al., Nuc. Acids Res., 17 9027 
(1990); Johnson, Genomics. 6 ; 243 (1990) and PCT 90/00434 
(published August 9, 1990)]. The labeled subject nucleic acid 

20 could be produced by such a adapter- linker PGR method from a 
few hundred cells; for example, wherein the subject nucleic^ 
acid is tumor DNA, the source DNA could be a few hundred tumor 
cells. Such a method could provide a means to analyse by GGH 
clonal sub-populations in a tumor. 

25 Another further preferred PGR method is a method 

employing a mixture of primers described in Meltzer et al . , 
"Rapid Generation of Region Specific Probes by Chromosome 
Microdissection and their Application: A Novel Approach to 
Identify cryptic Chromosomal Rearrangements," Wftt^ff? — 

30 SsnatiSfij-Kl)! 24-28 (April 1992). Microdissection of sites 
in the reference metaphase spread that produce signals of 
interest in GGH, would permit PGR amplification of nucleic 
acid sequences bound at such sites. The ai^lified nucleic 
acid could then be easily recovered and used to probe 

35 available libraries, as for example, c smid libraries, so that 
the amplified sequences could b m re rapidly identified. 

High copy r petitive s quences can be suppressed in 
amplifying the subj ct nucleic acid by PGR. The PGR primers 
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used for such a procedure are complementeuy t the ends of the 
repetitive sequences. Thus, upon proper orientation, 
amplification of the sequences flanked by the repeats occtars. 
One can further suppress production of repetitive sequences in 
5 such a PGR procedure by first hybridizing complementary 
sequences to said repetitive sequences wherein said 
complementary sequences have extended non-complementary 
flanking ends or are terminated in nucleotides which do not 
permit extension by the polymerase. The non-complementary 

10 ends of the blocking sequences prevent the blocking sequences 
from acting as a PGR primer during the PGR process. Primers 
directed against the Ala and repetitive DNA families have 
allowed the selective amplification of human sequences by 
interspersed repetitive sequence PCR (IRS-PCR) [Nelson et al. , 

15 pyAS, 86: 6686 (1989); Ledbetter et al.. Genomics. 6 > 475 
(1990)]. 

Archived Material 

An important aspect of this invention is that 

20 nucleic acids from archived tissue specimens, for exan^le, 
paraffin-embedded or formalin-fixed pathology specimens, can 
be tested by the methods of CGH. Said nucleic acid cannot, of 
course, be prepared into chromosome spreads for traditional 
cytogenetic chemical staining. Also, it is difficult for 

25 large enough restriction fragments to be extracted from such 
inaterial for other conventional research tools, such as 
Southern analysis. However, the nucleic acid from such 
specimens can be extracted by known techniques such as those 
described in Greer et al., Anatomic Pa tholoav, ^^ O}, 117-124 

30 (1991) and Dubeau et al., Cancer Res,, d6; 2964-2969 (1986), 
and if necessary, amplified for testing by various CGH 
methods. Such nucleic acid can be amplified by using a 
polymerase chain reaction (PCR) procedure (described above) , 
for example, by the method described in Greer et al., supra 

35 wherein DNA from paraffin-embedded tissues is amplif i d by 
PGR. 

A particular value of testing such archived nucleic 
acid is that such specimens are usually key d to the medical 
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records of the patients from wh m the .specimens were tedcen. 
Therefore, valuable diagnostic/prognostic associations can be 
made between the revealed cytogenetic state of patients' 
nucleic acid material and the medical histories of treatment 
5 and outcome for those patients. For example, information 
gathered by C6H can be used to predict the invasiveness of a 
tumor based upon its amplification and/or deletion pattern 
matched to associations made with similar patterns of patients 
whose outcomes are Icnown. 

10 Analogously, other nucleic acid that is fixed by 

some method, as, for example, archaelogical material preserved 
through natural fixation processes, can also be studied by CGH 
procedures. As indicated above, copy number differences 
between species provide information on the degree of 

15 similarity and divergence of the species studied. 

Evolutionarily important linkages and disjunctions between and 
euQong species, extant or extinct, cam be made by using the 
methods of CGH. 

20 Tumor Cytogenetics 

CGH provides the means to assess the association 
between gene amplification and/or deletion and the extent of 
t\imor evolution. Correlation between amplification and/or 
deletion and stage or grade of a cancer may be prognostlcally 

25 important because such information may contribute to the 
definition of a genetically based tumor grade that would 
better predict the future course of disease with more advemced 
tumors having the worst prognosis. In addition, information 
about early eunplif icatlon and/or deletion events may be useful 

30 in associating those events as predictors of subsecpient 
disease progression. Gene amplification and deletions as 
defined by CGH to, for example, normal metaphase spreads 
(genomic site, intensity of the signal and/or differences in 
signal ratios, and number of different genomic sites at which 

35 the copy number differences occur) can be associated with 

other known parameters such as tumor grade, hlstol gy, Brd/Urd 
labeling index, hormonal status, nodal inv Ivement, tumor 
size^ siirvival duration and other tumor pr pertl s available 
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from epidemi logical and biostatistical studies. For example, 
ttimor DNA to be tested by CGH could include atypical 
hyperplasia, ductal carcinoma in situ , stage I-IXI cancer and 
metastatic lymph nodes in order to permit the identification 
5 of associations between amplifications and deletions and 
stage. 

The associations made may make possible effective 
therapeutic intervention. For example, consistently amplified 
regions may contain an over expressed gene, the product of 

10 which may be able to be attacked therapeutically (for example, 
' the growth factor receptor tyrosine kinase, pl85 • 

CGH hybridizations of nucleic acids from cells .of 
primary cancers that have metastasized to other sites can be 
used to identify amplification and/or deletion events that are 

X5 associated with drug resistamce. For example, the subject 
nucleic acids to be analysed could be selected so that 
approximately half are from patients whose metastatic disease 
responded to chemotherapy and half from patients whose tiunors 
did not respond. If gene amplification and/or deletion is a 

20 manifestation of karyotypic instability that allows rapid 
development of drug resistance, more amplification and/or 
deletion in primary tumors from chemoresist2mt patients than 
in tiimors in chemosensitive patients would be expected. For 
example, if anqplification of specific genes is responsible for 

25 the development of drug resistance, regions surrounding those 
genes would be expected to be amplified consistently in tumor 
cells from pletiral effusions of chemoresistant patients but 
not in the primary txmors. Discovery of associations between 
gene amplification emd/or deletion and the development of drug 

30 resistance may allow the identification of patients that will 
or will not benefit from adjuvant therapy. 

Once a new region of amplification or deletion has 
been discovered by CGH, it can be studied in more detail using 
chromosome-specific painting [Pinkel et al. , PNAS fUSA> . 85 ; 

33 9138-9142 (1988); £P Publication No. 430,402 (June 5, 1991)] 
with a collection of probes that span the amplified or deleted 
region. Probes to amplified regions will show more signals 
than centromeric signals from the same chromosome, wher as 
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probes to nonamplified regions will show approximately the 
same number of test and centromeric signals* For example, the 
amplified regions on 17q22-23 and 20qter (discussed as newly 
discovered regions of amplification in Example 1) show 
5 variability in size from tumor to tumor using CGH (the 17q22- 
23 region more markedly) ; it can be expected that the region 
containing the important gene(s) can be narrowed by mapping 
the regions of amplification in multiple tumors in more detail 
to find the portion that is amplified in all cases. Probes 
10 for those studies can be selected, for example from specific 
cosmid libreuries produced by the National LeUsoratory Gene 
Library Project and/ or from the National Institute of Health ' 
(NIH) genomic research projects • 

The c-erbB-2 oncogene, also referred to as HER-2 or 
15 HSlif encodes for a 185 kilodalton (Kd) protein. Studies have 
reported c-erbB-*2 gene amplification in humem mammary tumor 
cell lines. [Kraus et al., EMBO J. 6 ! 605-610 (1987); van de 
Vijver et al., Mol. Cell Biol-, it 2019-2023 (1987).] Also, 
c-erbB-2 gene amplification in himem breast cancer has been 
20 shown to be associated with disease behavior, and may be a 
predictor of clinical outcome. [Slamon et al.. Science. 23$ a 
177-182 (1987); Berger et al.. Cancer Res.. Aft= 1238-1243 
(1988); Zhou et al., Cancer Res,, 47s6i23-6i2S (1987); and 
Venter et al., tang^t, \\x 69-71 (1987)]. C-erbB-2 has also 
25 been shown to be amplified in ovarian cemcers. [Alitalo and 
Schwab, Advances in Cancer Res., 47 ! 235-281 (1986).] 

C-nvG is a proto-oncogene which is the cellular 
homolog of the transforming gene of the chicken retrovirus 
HC29. In humans, c-mvc lies on the long arm of chromosome 8, 
30 at band 124, and spans about 5 kilobase pairs. The myc 
protein is a phosphoprotein present in the nucleus. The 
normal function of c-myc is unknown; however, it also 
certainly plays a role in cell division, and is expressed in 
normally growing cells as well as in tumor cells. It is now 
35 widely believed that translocati ns involving c-mvc lead to 
altered transcription of the gene, c ntributing to malignant 
transformation. 
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Sequences from N- myc member of the myc gene family 
have been shown to be amplified as much as a thousandfold in 
some neuroblastomas. N- mvc amplifications are usually seen in 
the later stage III eind IV tumors. Some small-^cell lung 
5 carcinomas also have amplified myc genes in double minute 
chromosomes (DKs) and homogeneously staining regions (HSRs) . 
Myc has also been shown to be amplified in colon cancer. 
[Alitalo and Schwab, supra . ] Again such amplifications axB 
found in late stages of tumor development, in the so^-called 

10 vcuriant cells that exhibit a more malignant behavior. 

Amplifications can involve either c-mvc . N- mvc or another 
member of the mvc gene family, L- myc . [Watson et al., supra 
at pp. 1084-1086]. 

In addition, overexpression has been observed for 

15 the p-glycoprotein gene family associated with multi-drug 
resistance and for drug metabolizing enzymes such as P450 
containing enzymes and glutathione S-transf erase. [Fair child 
and Cowan, J. Radiation Oncol, Btol, Phvs,. 20; 361-367 
(1990).] 

20 Identification of aiq>lified and/ or deleted genes is 

important to the management of cancer^ for example, breast 
camcer, for several reasons: 

1) to improve prognostication; 

2} to detect amplification and/or deletion events 
25 that are associated with the development of drug resistance; 
and 

3) to improve therapy. 
For example, in regaurd to improving prognostication, in breast 
cancer the amplification of oncogenes, such as int -2 . erbB-2 

30 and mvc occur frequently and have been associated with 

aggressive growth and poor prognosis in some studies. [Schwab 
and Amier, Genes, cairomosomes & Cancer. 1 ; 181-193 (1990).] 
In regard to reason (2), gene as^lification has cleeurly been 
shown to lead to drug resistance in vitro (for example, 

35 eunplif ication of the dihydrofolate reductase gene c nf rs 
resistance to methotr xat ) , and is likely t occur in 
patients xindergoing th rapy as well (for example, as a result 
of over expr ssion of glutathion S-trsuisf rase and p- 
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glycoprotein). [Fairchild and C wan, suorg ] . Thus, the 
identification of resistance-blinked genes would have a major 
impact on therapy by allowing therapy modification as 
resistance-related gene amplification occurs. Therapy could 
5 be improved by targeting for specific therapy, tumors that 
overexpress specific amplified genes, 

Prenatal pia^qposj,^ 

Prenatal screening for disease-linked chromosome 
10 aberrations (e.g., trisomy 21) is enhanced by the rapid 

detection of such abberrations by the methods and compositions 
of this invention. CGH analysis is paurticularly significant 
for prenatal diagnosis in that it yields more rapid results 
than are available by cell culture methods* 

15 

Removal of Repetitive Sequences and/or Dlsablinc^ i^he 
Hybridization Capacity of Repetitive Seouenc^g 

The following methods can be used to remove 
repetitive sequences and/or diseible the hybridization capacity 

20 of such repetitive sequences. Such methods are representative 
and are expressed schematically in terms of procedures well 
known to those of ordinary skill the art, and which can be 
modified and extended according to parameters and procedures 
well known to those in the art. 

25 Bulk Procedures. In many genomes, such as the human 

genome, a major portion of distributed (or shared) repetitive 
DHA is contained in a few families of highly repeated 
sequences such as Alu. These methods primarily exploit the 
fact that the hybridization rate of complementaury nucleic acid 

30 strands Increases as their concentration increases. Thus, if 
a mixture of nucleic acid fragments is denatured and incubated 
under conditions that permit hybridization, the sequences 
present at high concentration will become double-stranded more 
rapidly than the others. The double-str€unded nucleic acid can 

35 then be rem ved and the remainder used in the hybridizati ns. 
Alternatively, the partially hybridized mixture can be used as 
the subj ct nucleic acid, th double-stranded sequences being 
unable to bind to the target. The f 1 lowing are methods 
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representative of bulk procedures that are useful for 
disabling the hybridization capacity of repetitive sequences 
or removing those sequences from a mixture. 

Self -reassociation > Double-stranded nucleic acid in 
5 the hybridization mixture is denatured and then incubated 
under hybridization conditions for a time sufficient for the 
high*copy sequences in the mixttire to become substantially 
doxible-stranded. The hybridization mixture is then applied to 
the reference chromosome spread. The remaining labeled 

10 single-stranded copies of the highly repeated sequences may 
bind throughout the reference chromosome spread producing a 
weak, widely distributed signal. 

Use of blocking n ucleic acid . Unlabeled nucleic 
acid sequences which are complementary to those sequences in 

15 the hybridization mixture whose hybridization capacity it is 
desired to inhibit are added to the hybridization mixture. 
The subject nucleic acids and blocking nucleic acid are 
denatured, if necessary, and incxibated under appropriate 
hybridization conditions. The sequences to be blocked become 

20 double-stranded more rapidly than the others, and therefore 
are uneUsle to bind to the reference spread when the 
hybridization mixture is applied to the spread. In some 
cases, the blocking reaction occurs so quickly that the 
incubation period can be very shorty and adequate results can 

25 be obtained if the hybridization mix is applied to the spread 
immediately after denattiration. Further, the probe and the 
target can be simultaneously denatured in some cases. A 
blocking method is generally described in the context of 
Southern analysis by Sealy et al., "Removal of Repeat 

30 Sequences form Hybridization Probes", Nucleic Acid Research , 
13:1905 (1985). Examples of blocking nucleic acids include 
genomic DNA, a high-copy fraction of genomic DNA and 
particular sequences as outlined below. 

i. Genomic DNA . Genomic DNA contains all of the 

35 nucleic acid sequences of the organism in proportion to their 
copy-number in the genome. Thus, adding genomic DNA to the 
hybridization mixture increas s the concentration of the 
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high-copy repeat sequences more than low-copy sequenc s, and 
therefore is more effective at blocking the former. 

ii. Hiah-coDV fraction of genomic DNA. 
Fractionating the genomic DNA to obtain only the high-copy 
3 sequences and using them for blocking can be done, for 
example, with hydroxyapatite as described below « 

Removal of Sequences. 

Hvdroxvaoat ite . Single- and double-stranded nucleic 

10 acids have different binding characteristics to 

hydroxyapatite. Such chetracteristics provide a basis commonly 
used for fractionating nucleic acids. Hydroxyapatite is 
commerically availcQ)le [e.g., BioRad Laboratories, Hercules, 
CA (USA) ] • The fraction of q :inomic DNA containing sequences 

15 with a particular degree of ^ petition, from the highest 
copy-ntimber to single-copy, cun be obtained by denaturing 
genomic DNA, allowing it to reassociate under appropriate 
conditions to a particular value of C^t, followed by 
separation using hydroxyapatite. The single- eutd 

20 double-stranded nucleic acid can also be discriminated by tise 
of SI nuclease. Such techniques and the concept of C^t are 
explained in Britten et al., Analysis of Repeating DNA 
Sequences by Reassociation" , in Methods in Enzvmolocrv. 29 ; 
363-418 (1974). 

25 p^ac^jpn yitfr IfflBPblltgya nwlgjiff agid* Removal of 

particular sequences can also be accomplished by attaching 
single-stranded "absorbing'' nucleic acid sequences to a solid 
support. Single-stranded source nucleic acid is hybridized t 
the immobilized nucleic acid. After the hybridization, the 

30 unbound sequences are collected and used in CGB. For example, 
human genomic DNA can be used to aUssorb repetitive sequences 
from the subject nucleic acids. One such method is described 
by Brison et al., '*(»eneral Method for Cloning Amplified DNA by 
Differential Screening with Genomic Probes," Molecular and 

35 Cellular Blolocfv. 2 ; 578-587 (1982). Bri fly, minimally 

sheared human genomic DNA is bound to diazonium cellul se or a 
like support. The sourc DNA, appr priat ly cut into 
fragments, is hybridiz d against the immobilized DNA to C^t 
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values in the range of about l to 100. The preferred 
stringency of the hybridization conditions may vary depending 
on the base composition of the DNA. 

Prehvbridization^ Blocking of repeat sequence 
5 binding sites in the refere:ice genome by hybridization with 
unlabeled complementary sequences will prevent binding of 
labeled sequences in the sxibject nucleic acids that have the 
potential to bind to those sites. For example, hybridization 
with unlabeled genomic DNA will render the high-copy 
10 repetitive sequences in the reference genome double-stranded. 
Labeled copies of such sequences in the subject nucleic acids 
will not be able to bind when they are siibsec[uently applied. 

In practice, several mechemisms Ccm be combined to 
produce the desired contrast euid sensitivity. 

15 

I,abelina the Nucleic Acid Pra gments of the Sub-i^ei. 
yucle;LC Acid^ 

There are many techniques available for labeling 
single- and dotible-streuided nucleic acid fragments of the 

20 subject nucleic acids. They include incorporation of 
radioactive labels, e.g. Harper et al. Chromosoiita , ^i ; 
431-439 (1984); direct attachment of f luorochromes or enzymes, 
e.g. Smith et al., Nuc. Acids Res>. T3> 2399-2412 (1985), and 
Connolly et al., Nuc. Acids Res,. 13* 4485-4502 (1985); and 

25 various chemical modifications of the nucleic acid fragments 
that render them detectable immxinochemically or by other 
affinity reactions, e.g. Tchen et al., "Chemically Modified 
Nucleic Acids as Immunodetectable Probes in Hybridization 
Experiments," PNAg, 9Xi 3466-3470 (1984); Richardson et al., 

30 "Biotin and Fluorescent Labeling of RHA Using T4 RNA Ligase," 
Nuc. Acids Res. , 11; 6167-6184 (1983) ,v Langer et al., 
"Enzymatic Synthesis of Biotin-Labeled Polynucleotides: Novel 
Nucleic Acid Affinity Probes," PNAS. 78 : 6633-6637 (1981); 
Brigati et al., "Detection of Viral Genomes in Cultured Cells 

35 and Paraffin-Embedded Tissue Sections Using Biotin-Labeled 
Hybridization Probes," Virol,, 12 ^> 32-50 (1983); Brok r et 
al., "Electron Microscopic Visualization of tRNA Genes with 
F rritin-Avidih: Biotin Labels," Nuc. Acids Res,, s ; 363-384 
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(1978); Bayer et al., «The Use of the Avidin Biotin Complex as 
a Tool in Moleculeor Biology," Methods of Biochem. Analysis. 

1-45 (1980) ; Kuhlmann, Immunoenzyiae Techniques ±r} 
Cytochemistry (Weinheim, Basel, 1984). Langer-Safer et al., 
5 PWAS (USA), 22: 4381 (1982): Landegent et al., Exp. Cell Res .^ 
153: 61 (1984); and Hopman et al., Exp. Cell Res . . 169 ; 357 
(1987). Thus, as indicated, a wide yariety of direct and/ or 
indirect means are availcd^le to enable yisualization of the 
subject nucleic sequences that have hybridized to the 
10 reference genome. Suitable visualizing means include various 
ligamds, radionuclides, f luorochromes and other fluorescers, 
chemiliaainescers, enzyme substates or co-factors, particles, 
dyes and the like. Some preferred exemplary labeling means 
include those wherein the probe fragments are biotiny lated , 
15 modified with N-acetoxy-N-2-acetylaminof luorene, modified with 
fluorescein isothiocyanate or other f luorochromes, modified 
with mercury /TNP ligand, sulfonated, digoxigeninated or 
contain T-T dimers. 

A preferred method of labeling is tailing by 
20 terminal transferase labeling. Another preferred method is 
ramdom priming with mixed sequence primers followed by 
polymerase extension. This has the additional feature of 
amplifying the amount of subject DNA, if several cycles are 
used, which is useful when only a small amount of DNA was 
25 originally obtained from the subject cell or cell population. 

The key feature of leUdeling is that the subject 
nucleic acid fragments boiind to the reference spread be 
detectable. In some cases, an intrinsic feature of the 
subject nucleic acid, rather than an added feature, can be 
30 exploited for this purpose. For example, antibodies that 

specifically recognize RNA/DNA duplexes have been demonstrated 
to have the ability to recognize probes made from RNA that are 
bound to DNA targets [Rudkin and Stollar, Nature . 265:472-473 
(1977)]. The RNA used is unmodified. Nucleic acid fragments 
35 can be extended by adding "tails" of modified nucleotides or 
particular normal nucleotides. When a n rmal nucle tide tail 
is used, a second hybridization with nucleic acid 
complementary to the tail and containing f luorochromes. 
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enzymes, radi activity, modified bases, among other labeling 
means, allows detection of the bound nucleic acid fragments. 
Such a system is commercially available from Enzo Biochem 
[Biobridge Labeling System; Enzo Biochem Inc., New York, 
5 N.Y. (USA)]. 

Another example of a means to visualize the bound 
nucleic acid fragments wherein the nucleic acid sequences do 
not directly carry some modified constituent is the use of 
antibodies to thymidine dimers. Nakane et al., ACTA 

10 Histochem. Cytochem. . 20 (2) :229 (1987), illustrate such a 
method wherein thymine-thymine dimerized DNA (T-T DNA) was 
used as a marker for in situ hybridization. The hybridized 
T-T DNA was detected immunohistochemically using rabbit 
anti-T-T DNA antibody. 

15 All of the labeling techniques disclosed in the 

abovB references may be preferred under peurticular 
circumstances. Further, any leU^eling techniques known to 
those in the art would be useful to label the subject nucleic 
acids in of this invention. Several factors govern the choice 

20 of Isibeling means, including the effect of the label on the 
rate of hybridization and binding of the nucleic acid 
fragments to the chromosomal DNA, the accessibility of the 
bound nucleic acid fragments to labeling moieties applied 
after initial hybridization, the mutual compatibility of the 

25 leU3eling moieties, the nature and intensity of the signal 
generated by the label, the expense and ease in which the 
label is applied, and the like. 

Several different subject nucleic acids, each 
labeled by a different method, can be used simultaneously. 

30 The binding of different nucleic acids can thereby be 
distinguished, for exao^le, by different colors. 

In Situ Hvbridization. 

Application of the subject nxicleic acids to the 
35 reference chromosome spreads is accomplished by stand2a:d in 
situ hybridization techniques. Several excellent guides to 
the technique are availabl , e.g.. Gall and Pardue, **Nucleic 
Acid Hybridization in Cytological Preparations," Methods in 
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Enzvmoloav. 21: 470-480 (1981); Henderson, "Cytological 
Hybridization to Mamaalian Chromosomes," International Rev;jetj 
of Cvtoloov. 76: 1-46 (1982); and Angerer et al. , " in situ 
Hybridization to Cellular RNAs," in Genetic Engineering : 
^ Pyj.nciples ap4 t^etftoc^g, Setlow and Hollaender, Eds., Vol, 7, 
pgs. 43-65 (Plenum Press, New York, 1985). 

Generally in situ hybridization comprises the 
following major steps: (i) fixation of tissue or biological 
structure to be examined, (2) prehybridization treatment of 

10 the biological structure to increase accessibility of target 
DNA, and to reduce nonspecific binding, (3) hybridization of 
the mixture of nucleic acids to the nucleic acid in the 
biological structure or tissue; (4) posthybridization washes 
to remove nucleic acid fragments not bound in the 

15 hybridization and (5) detection of the hybridized nucleic acid 
fragments. The reagents used in each of these steps and their 
conditions of use vary depending on the particular situation. 

Under the conditions of hybridization wherein human 
genomic DNA is used as an agent to block the hybridization 

20 capacity of the repetitive sequences, the preferred size range 
of the nucleic acid fragments is from eUsout 200 bases to aibout 
1000 bases, more preferably about 400 to 800 bases for double- 
stranded, nick- translated nucleic acids €uid about 200 to 600 
bases for single-stranded or PGR adapter-linker amplified 

25 nucleic acids. 

Example 1 provides details of a preferred 
hybridization protocol. Basically the same hybridization 
protocols as used for chromosome-specific painting as 
described in Pinkel et al., PNAS rnsA^ , as: 9138-9142 (1988) 

30 and in EP Pub. No. 430,402 (published June 5, 1991) are 
adapted for use in CGH. 

The following representative examples of 
performing CGH methods of this invention are for purposes of 
illustration only and are not meant to limit the invention in 

35 any way. 
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py^pl-e 1 

DNA froTa Pyeast Cancer Lines 
Hybridized to Normal Metaphase Spreads 

In this Example, methods of this invention to 
5 analyse genomes by Comparative Genomic Hybridization (CGH) are 
exen^lified by hybridizations of breast cemcer cell lines to 
normal metaphase spreads. The teurget metaphase spreads were 
pre-hybridized with iinlabeled human placental DNA to block the 
high copy repeat sequences. In this representative example, 
1Q the hybridization mixture containing the extracted l2j3eled DNA 
• from the cell lines contained unlabeled, repeat-enriched Cot-l 
blocking DNA [obtained from Bethesda Research Laboratories 
(BRL), Gaithersburg, MD (USA]. 

The experiments outlined below include in the 
hybridization mixture for the subject genomes, that is, the 
breast cancer cell line DNAs, chromosome-specific repeat 
sequence probes and chromosome-specific painting probes. 
Those probes leOseled with biotin were included aa an adjunct 
for identifying chromosomes in the metaphase preparations. 
20 The experiments were first performed without those chromosome- 
specific probes. Then each chromosome of interest was 
measured to deteinnine its length which was considered along 
with other factors to determine its probable identity. The 
chromosome-specific probes were then used in the hybridization 
25 mixture to confirm the identity of the chromosome of interest. 
However, such probes are not necessary as the chromosomes 
could have been identified by the DAPI banding of the 
counterstain or by other chemical staining, such as staining 
with quinacrine, by a skilled cytogeneticist. 

30 

cell Lines and Isolation of DNA! 

Six established breast cancer cell lines: BT- 
474, SK-BR-3, MCF-7, MDA-MB-361, MDA-MB-468 and T-47D were 
obtained from the American Type Culture Collection [Rockville, 
35 Msuryland (USA) ] . The breast cancer cell line 600MPE cell line 
was kindly provided by Dr. Helene S. Smith [Geraldine Brush 
Cancer Research Center, San Francisco, CA (USA)]. C 11 lin s 
were grown until they became confluent. Cells were then 
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trypsinized, pelleted by centrifugation at 1500 RPM for 5 
minutes and washed twice in phosphate buffered saline. The 
DNA was then isolated as described by Sambrook et al., 
Molecular Cloning: A Lab oratory Manual . Vol. 2: 9 .16*9 ,19 
[Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
(USA) 1989]. 

Details concerning the established human breast 
cancer cell lines used herein are as follows: 

BT-474 - Originated from a human primary cancer; 

obtained from the ATCC, catalog # HTB 20; 



SK-BR-3 - Originated from a human metastatic breast 

adenocarcinoma derived from a pleural 
effusion; obtained from the ATCC catalog # 
15 HTB 30; 

MDA-MB-361 - Originated as a metastatic tumor to the 

brain; obtained from the ATCC, catalog # HTB 
27; 

20 

MCF-7 - Originated from a human metastatic pleural 

effusion; obtained from the ATCC, catalog # 
HTB 22; 

25 T-47D - Originated as a hxman metastatic pleural 

effusion; obtained from the ATCC catalog # 
HTB 133; 

600MPE - Originated as a human metastatic pleural 

30 effusion; kindly provided by Dr. Helene S. 

Smith [Geraldine Brush Cancer Research 
Center, San Francisco, CA (USA)]; and 

MDA-MB-468, - Originated as a metastatic pleural effusion; 
35 obtained from the ATCC, catalog # HTB 132. 

Preparation of Normal Lvmphocvte Metaphases ; 

Normal peripheral blood lys^hocytes were stimulated 
by PHA, synchronized by methotrexate treatment and blocked in 
metaphase using 0*05 ug/ml colcemid. Cells were then 
centrifuged, washed and incubated in 75 mM KCl at 37®C for 15 
minutes. Cells were then fixed in methanol: acetic acid (3:1) 
and dropped onto slides. The slides were stored under 
nitrogen at -20^C. 
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DNA Labeling : 

Cell line DNAs were led>eled with digoxigenin**ll-clUTP 
using nick translation [Rigby et al. , J. Mol. Biol , , 113 : 237 
(1977); Sambrook et al.^ supra 1 . The optimal size of the 
5 probe fragments after nick translation and before denatxiring 
was 400-800 bps. As indicated above, chromosome-specific 
probes were used in dual-color hybridizations to verify the 
identification of chromosomes of interest in the metaphase 
spreads. Representative examples of such chromosome-specific 
10 reference probes labeled with biotin-14-dATP include the 
following: 

1) a chromosome-specific painting probe for 
chromosome 20 prepared by the PGR adapter-linker method as 
described in PCT/US90/00434 published August 9, 1990; 

13 2) a chromosome 17 peri-centromeric repeat probe 

(cosmid CK17.10) isolated by Anne Kallioniemi from a 
chromosome 17 cosmid library from Los Alamos National 
Laboratory [Albuquerque, New Mexico (USA) J; an equivalent 
chromosome-specific repeat sequence probe for chromosome 17 is 

20 commercially available from Oncor [Gaithersburg, HD (USA) ] ; 
and 

3) an alpha satellite repeat probe specific for the 
centromeric region of chromosome 8 [kindly provided by Dr. 
Heinz-Ulrich G. Weier; University of California Medical 
25 Center^ LaOs for Cell Analysis, San Francisco, CA (USA)]; that 
probe was generated by Dr. Weier using PCR with primers WAl 
and WA2 as described in Weier et al, , Hum, Genets. 87 : 489- 
494 (1991). 

Ones skilled in the art recognize that there are 
30 many other equivalent probes available that could be used for 
the confirmation pxirposes described* For example, whole 
chromosome painting probes are now available for all the hixman 
chromosomes [Collins et al.. Genomics, li t 997-1006 (1991)]. 
Also available are repeat sequence probes that hybridize 
35 intensely and specifically to selected chromosomes [Trask t 
al.. Him- Genet.. 78 i 251 (1988) and references cited 
therein] • 
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Pretreatment and Prehvbridi2ation of Slides; 

Lymphocyte metaphase preparations were first 
denatured in 70% f ormamide/2XSSC (IXSSC is 0.15 M NaCl, 0.015 
M NaCitrate), pH 7, at 70^C for 2 minutes and dehydrated in a 
5 sequence of 70%, 85% and 100% ethanol. The slides were then 
air dried and treated with 10 ug/50 ml Proteinase K 
[Boehringer Mannheim GmbH, Indianapolis IN (USA)] for 7.5 
minutes at 37^C in a buffer containing 20 mM Tris and 2 mM 
CaClj (pH 7.5). Ethanol dehydration was then done as 

10 described above, and the slides were prehybridized with ten ul 
of a hybridization mixture, consisting of 20 ug unlabeled 
human placental DNA [obtained from Sigma, St. Louis, MO (USA) ; 
size of the fragments is 200-700 bps] in 50% formeunide, 10% 
dextran sulphate and 2XSSC (pH 7) for 60 minutes at 37°C. 

15 Before the prehybridization mixture was applied to the slides, 
it was denatured in a 70°C water bath for 5 minutes. After 
prehybridization, the slides were washed once in 2XSSC and 
dehydrated with ethanol as described above. 

20 qyj^r^d^^^tApp: 

Five ug of xinlabeled, repeat-enriched Cot-1 blocking 
DNA [BRL, Gaithersburg, MD (USA)] and 60 ng of digoxigenin 
labeled cell line DMA and 20-60 ng of blotin-labeled reference 
probes (for verification of chromosome identification) were 

25 mixed together and 1/10 vol of 3M Na-acetate was added. DMA 
was precipitated by adding 2 volumes of 100% ethanol followed 
by centrifugation in a microcentrifuge for 30 minutes at 
15,000 RPM. Ethanol was removed and the tubes were allowed to 
dry until all visible ethanol had evaporated. Ten ul of 

30 hybridization buffer consisting of 50% formamide, 10% dextran 
sulphate and 2XSSC (pH 7) was then added, followed by careful 
mixing. DMAs in the hybridization buffer were then denatured 
for 5 minutes at 70®C followed by a 60 minute renaturation at 
37®C. The hybridization mixture was then added to the 

35 prehybridized lymphocyte metaphas slides. Hybridization was 
carried out tinder a coverslip in a moist chamber for 3-4 days 
at 37®C. 
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Xmmunofluorescent Probe Detection: 

The slides were washed three times in 50% forxnaiaide/ 
2XSSC, pH 7, twice in 2XSSC and once in O.IXSSC for 10 minutes 
each at 45®C. After washing, the slides were 
5 immxinocytochemically stained at room temperature in three 
steps (30-45 minutes each) • Before the first 
immunocytochemical staining, the slides were preblocked in 1% 
BSA/4XSSC for 5 minutes. The first staining step consisted of 
2 ug/ml Texas Red-Avidin [Vector Laboratories, Inc., 

10 Burlingame, CA (USA) ] in 1% BSA/4XSSC. The slides were then 
washed in 4XSSC, 4XSSC/0.1% Triton X-100, 4XSSC, and PN (a 
mixture of O.l M NaH2P04 and 0.1 M Na2HP04, pH 8, and 0.1% 
Konidet P-40) for 10 minutes each and preblocked with PNM (5% 
Carnation dry milk^ 0.02% Na-azide in PN buffer) for 5 

15 minutes. The second antibody incubation consisted of 2 ug/ml 
FITC-conjugated sheep anti*digoxigenin [Boehringer Memnheim 
GmBH, Indianapolis, IN (USA)] and 5 ug/ml anti-avidin [Vector 
Laboratories, Burlingame, CA (USA)] in PNM followed by three 
PN washes, 10 minutes each. After the PNM block, the third 

20 immunochemical staining was done using rabbit euiti-sheep FITC 
antibody (1:50 dilution) (Vector Laboratories) and 2 ug/ml 
Texas Red-Avidin in PNM. After three PN washes, nuclei were 
counterstained with 0.8 xiM 4,5-diamidino-2-phenylindole (DAPI) 
in an antifade solution. 

25 

Fluorescence M icroscopy and Interpretation of Results; 

A NDcon fluorescence microscope [Nikon Inc., Garden 
City, NY (USA) ] equipped with a double band pass filter 
[Chroma Technology, Brattleboro, VT (USA)] and a lOOX 

30 objective was used for simultaneous visualization of the FITC 
and Texas Red signals. Hybridization of the breast cancer 
cell line DMAs was seen as a more or less uniform faint green 
background staining of all metaphase chromosomes with the 
exception of the Y-chromosome. As the breast cancer cell 

35 lines are of course of femal origin, th y did not contain Y 
chromosomal DNA. The ctbs nc of said green staining of the Y 
chromosome of the metaphase spread is xemplary of the mann r 
in which a cytog netically significant deleti n would be 
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visualized. The absence of the Y chromos me in the breast 
cancer cell line DNA was detected, as would a cytogenetically 
significant deletion, by the hybridization wherein the Y 
chromosome of the reference spread was only stained by the 
5 LAPl counterstain. Using a fluorescence microscope, amplifi d 
sequences can be seen as bright green dots or bands along the 

chromosome eurms. 

To facilitate the display of the results and to 
improve the sensitivity of detecting small differences in 

10 fluorescence intensity, a digital image analysis system 

(QUIPS) was used. QUIPS (an acronym for quantitative image 
processing system) is an automated image analysis system based 
on a standard Nikon Microphot SA [Nikon Inc. , Garden City, NY 
(USA)] fluorescence microscope equipped with an automated 

15 stage, focus control and filterwheel [Ludl Electronic Products 
Ltd., Hawthorne, NY (USA)l. The filterwheel is mounted in th 
fluorescence excitation path of the microscope for selection 
of the excitation wavelength. Special filters [Chroma 
Technology, Brattleboro, VT (USA)] in the diohroic block all w 

20 excitation of multiple dyes without image registration shift. 
The microscope has two camera ports, one of which has an 
intensified CCD camera [Quantex Cojnp. , Sunnyvale, CA (USA)] 
for sensitive high-speed video image display which is used for 
finding interesting areas on a slide as well as for focusing. 

25 The other camera port has a cooled CCD camera [model 200 by 
Photometries Ltd. , Tucson, AZ (USA) ] which is used for the 
actual image acquisition at high resolution and sensitivity. 

The cooled CCD camera is interfaced to a SUN 4/330 
workstation [SUN Microsystems Inc., Mountain View, CA (USA)] 

30 through a VME bus. The entire acquisition of multicolor 
images is controlled using an image processing software 
package SCIL- Image [Delft Centre for Image Processing, Delft, 
Netherlands]. Other options for controlling the cameras, 
stage, focus and filterwheel as well as special programs for 

35 the acquisition and display of multic 1 r images were 

d veloped at the Division of Molecular Cytometry [University 
of California, M dical Center; San Francisc , CA (USA)] based 
on the SCIL-Image package. 
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To display the results of the comparative 
'^Z^T.T' consecutive images were acquired 

(DAPI PiTC and Texas Red) and superimposed. The FlTC^age 

dx-Played after using the thresholding and contrast 
enhancement options of the SCIL-lmage software. Ebcercisinc 
such options reduces the overall chromosomal fluorescence to 
make amplified sequences more readily visible. For example 
usxng thresholding and contrast stretching, it was possible'to 
enhance the contrast and quantification between the fai^t 

originating from the 
amplified sequences in the cell lines. Alternatively to 
facilitate the detection of deletions, it is possible to 
increase the overall chromosomal fluorescence and maJca areas 
15 ll;^:^ ''^^^-^^^^ ^^P^^ -^^^r. The red color was used 
15 for reference probes to help in the identification of 
chromosomes . 

Aft«r identification of th. chro«,somes bisecl on th. 
.-e Of refar»ce probes in . dual-color hybridization, a siS 
Of amplification was localized by fractional len^ 
20 measurements along the chroM»o« am (fractional length = 
distance of th. hybridlation sig^ii fro. the p-teiomC. 
d.v.ded by the total l«„th of th. chro«so«,. The band 
location Of the signal «a, then approximated fro. the 

WorangjatnTf , Karg«r a,, Basel, Switzerlaiid^ (1985) ] . 

,„ , K, *^ hybridizations are compil«l i„ 

30 tabl. 2 along „ith oth„ infor»tio„ Xno™ about th. 
lin«i. ».pUfiction at I7qu (.rbB-2 locus) and 

^^il?"'' T «" »«» ih lin« Showing 

ain>lificatio„ Of .rbB-2 and Wfc whMi.v« th. l«.i of 

amplification was grMter than about five- to ten-fold usino 

35 this CSS method, in addition, amplification Of ™.v «r^ 

^"^T'^r""" «t l„22-„ 

«d in thr.. lines at 20,t«:; thos. amplifications were 
previously untaown slt« of amplification and were not 
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expected from other studies. F r example, as indicated in 
Table 2, the BT-474 cell line is known to have a 13-fold c- 
erbB-2 amplification; C6H revealed amplified sequences at the 
following loci: 17ql2 (the erbB-2 locus); 17q22-q23 and 
20ql3-ter. The latter two sites were previously unrecognized 
sites of amplification in that cell line. 

All lines showing amplification showed cunplif ication 
at more them one site. Evidence for co-amplification may be 
clinically important since co-amplification has been observed 
previously [van de Vijver et al., Mol. Cell Biol. 7 : 2019- 
2023 (1987); Saint-Ruf et al.. Oncogene. 6 ; 403-406 (1991)], 
and is sometimes associated with poor prognosis [Borg et al., 
Br. J. Cancer. 63: 136-142 (1991)]. Amplification at 17q22- 
23 has also been seen using probe DNA from primary tumors. 

Results of Testing Breast Cancer 
Cell Lines for Amplification 



20 Cell 
Line 



25 



30 



35 



40 



Origin 



Known Amplifi- 
6ro%ith Hormone amplifi- cation 
rate; receptor cation detected 
-Td E/P (level) by CSH 



BT-474 Primary 
Cancer 



SK-BR-3 PI. Effusion 



48-96 hr +/- 



HDA-MB- Brain met. 
361 

MCF-7 PI. Effusion 



T-47D PI. Effusion 
600MPE PI. Effusion 



45 MDA-MB- Pl. Effusion 
468 



<96 hr 
<48 hr 



-/+ 
+/+ 

+/+ 



erbB-2 
(13X) 



erbB-2 
(9X) 

MYC(IOX) 



erbB-2 
(4X) 

erbB-2 
(none) 

erbB-2 
(none) 

erbB-2 
(n ne) 

erbB-2 
(none) 



17ql2 
(erbB-2) , 
17q22-23, 
20qter 

17ql2 
(erbB-2) , 
8q21, 
8q23-24.1 
(MYC), 20qter 

17q22-23 



17q22-23, 
20qter 

None 



N ne 



N ne 
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ample g 

Hybridizations with two different labeled subject 
DNAs as schematically outlined in Figures 5 and 6 were 
performed* One of the labeled subject DNAs hybridized was a 
5 cell line DNA as described in ^[xample 1 and similarly labeled. 
The other labeled subject DNA was human genomic DNA labeled 
with biotin-14-dATP. 

The protocols were essentially the same as in 
Example 1 except that no chromosome-specific reference probes 
10 were used, and the same amoxint of the lcQ)eled hximan DNA as the 
labeled cell line DNA, that is, 60 ng, was hybridized. Of 
coxirse, reference probes could be added to the hybridiz$ition 
mixture, but they need to be differently labeled to be 
distinguishable . 

15 The results showed the normal DNA with a red signal 

and the cell line DNA with a green signal. The green to red 
ratios were determined along each chromosome. Amplification 
was indicated by an area where the signal was predoioinemtly 
green whereas deletions were indicated by more red signals 

20 than in other cureas of the chromosomes. 

Exemplary, CGH results using breast cancer cell line 
600MPE DNA and normal human DNA were as follows. As indicated 
above, the hybridization was performed using 5 ug Cot-l DNA, 
60 ng of digoxigenin labeled 600MPE cell line DNA^ and 60 ng 

25 of biotinylated normal human genomic DNA. The 600MPE DNA was 
detected with FITC (green) and the genomic DNA with Texas Red- 
Avidin (red). 

The 600MPE breast cancer cell line, the karyotype 
for which was published by Smith et al. , JNCI. 78 r 611-615 

30 (1987), contains one normal chromosome 1 euid three marker 
chromosomes with chromosome l material in them: t(lq:i3q), 
lp(p22) and inv(l) (p36q21) . Thus, the cell line is disomic 
for the p-telomere-p22, trisomic for p22 -centromere and 
tetrasomic for the q-arm of chromosome 1. 

35 The comparative genomic hybridizations of this 

example apparently identified three different r gi ns on 
chromosome 1 that could be s parated according to the 
intensities of gre n and red colors. The q-arm of chromos me 
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1 had the high st intensity of green color (tmnor DNA) . The 
region from band p22 to the centromere was the second 
brightest in green, and the area from the p-telomere to band 
p22 had the highest intensity of red color (normal DNA) • 
5 Those hybridization results were consistent with the 

traditional cytogenetic analyses of that cell line stated 
immediately above. 

However, fxirther studies with CGH, as presented in 
Example 3, indicated that the CGH analysis of this example, as 
10 well as the published keUT^otype, were partially in error. Th 
CGH analysis of Example 3 motivated additional confirmatory 
experiments, as described therein, leading to correction of . 
the original CGH results and the published karyotype. 



15 Example 3 

COPY Number Karvotvpea of Tumor DNA 
In the representative experiments of CGH in this 
example, biotinylated total tumor DNA (cell line and primary 
tumor DNA) and digoxigenin-labeled normal human genomic DNA 

20 are simultaneously hybridized to normal human metaphase 

spreads in the presence of unlabeled blocking DNA containing 
high-copy repetitive sequences, specifically unlabeled Cot-1 
blocking DNA [BRL, Gaithersburg, MD (USA) ] • The following 
paragraphs detail the procedures used for the representative 

25 CGH experiments of this example. 

DNAs used in this example were labeled essentially 
as shown above in Example 1. DNAs were labeled with biotin- 
30 14-dATP or digoxigenin-ll-dUTP by nick translation [Rigby et 
al., suEia; Sambrook et al,, supra ] , The optimal size for 
doxible stranded probe fragments after labeling was 600-1000 
bp. 

35 Pretreatment of Metaphase Spreads; 

Lymphocyte metaphase preparations were denatured, 
dehydrated and air dried, treated with Prot inase K and 
dehydrated again as d scribed in Example 1. 
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Comparative Genomic Hybridization: 

Sixty ng of biotinylated test DNA, 60 ng of 
digoxigenin-labeled normal DNA and 5 ng of unlabeled Cot-1 DNA 
(BRL) were ethanol precipitated and dissolved in 10 /il of 50% 
5 formamide, 10% dextran sulfate, 2xSSC, pH 7. The probe 
mixture was denatured at 70*^C for 5 minutes, allowed to 
reanneal at 37®C for 60 minutes and hybridized to normal male 
metaphase chromosomes for 3-4 days at 37^C. 

iO Immunof luorescent Probe Detection : 

The slides were washed as described above in Example 
1, and immunocytochemically stained at room temperature in 
three thirty-minute steps: (I) 5 ng/ml FITC-Avidin [Vector 
Laboratories, Inc., Biirlingame, CA (USA)] and 2 Mg/ml anti- 

13 digoxigenin-Rhodamine (Boehringer Mannheim GHbH) ; (II) 5 fig/ml 
anti-avidin (Vector Laboratories) ; and (III) 5 iig/ml FITC- 
avidin. Nuclei were cotmterstained with 0.8 (M 4 , 5-diamino-2- 
phenylindole (DAPI) in antifade solution. A Zeiss 
fluorescence microscope equipped with a double band pass 

20 filter [Chroma Technology, Brattleboro, VT (USA) ] was used for 
simultaneous visualization of FITC and rhodamine signals. 

Digital Image Analvsis Svstem and Fluorescence Ratio Profiles 
The QUIPS system essentially a described above in 

25 Example 1 was used to analyse cpieuititatively the fluorescence 
signals. Fluorescence ratio profiles along the chromosomes 
were extracted using WOOLZ software package [developed at HRC, 
Edinburgh, Scotland] as follows: the DAPI image is used to 
set the morphological boundeury of each chromosome by 

30 thresholding. The chromosome outline is smoothed by a n 

number of opening and closing operations, a modified Hilditch 
skeleton is calculated and taken to represent the medial axis 
of the chromosome. The DAPI image is expanded outwards in all 
directions until the intensity field levels off (when 

35 backgroiind is reached) or begins to rise (due to an adjacent 
chromosome) . The intensity profile of each image along the 
medial axis and within the expanded DAPI image is then 
calculated by stunming th green and red f luorescenc pixel 
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values along the sequence of lines perpendicular to and spacec 
at unit distance along the medial axis. Modal green and red 
intensity values corresponding to the expanded DAPI image sire 
taken to represent the background fluorescence and used as the 
5 intensity origin. 



10 



15 



25 



30 



35 



Cejl Upeg; 
5637 



SK-BR-3 



Colo 205 



20 NCI-H508 



SW480 



SW620 



WiDr 



SK-N-MC 



40 CaLu3 



45 CaLu6 



NCI-H69 



50 



Originated from a humam primary bladder 
carcinoma; obtained from ATCC, catalog # 
HTB 9 

Originated from a human metastatic breast 
adenocarcinoma, derived from a pleural 
effusion; obtained from the ATCC, cataloo # 
HTB 30 

Originated from a human colon 
adenocarcinoma; obtained from the ATCC, 
catalog # CCL 222 

Originated from a human cecum 
adenocarcinoma; obtained from the ATCC, 
catalog # cCL 253 

Originated from a hitman colon 
adenocarcinoma; obtained from the ATCC, 
catalog # CCL 228 

Originated from a human lymph node 
metatasis of a colon adenoc^u^cinoma ; 
obtained from the ATCC, catalog # CCL 227 

Originated from a human colon 
adenocarcinoma; obtained from the ATCC, 
catalog # CCL 218 

Originated from a human neuroblastoma 
(metastasis to supra-orbital eurea) ; 
obtained from the ATCC, catalog # HTB 10 

Originated from a human lung 
adenocarcinoma, derived from a pleural 
effusion; obtained from the ATCC, cataloa # 
HTB 55 

Originated from a hxman anaplastic 
carcinoma, probably lung; obtained from the 
ATCC, catalog # HTB 56 

Originated from a human small cell lung 
carcinoma; obtained from the ATCC, catalog 
# HTB 119 



wo 93/18186 



PCr/US93/01775 



- 66 - 



10 



20 



35 



40 



45 



COLO 320HSR - Originated f r m a human c Ion 

adenocarcinoma; obtained from the ATCC 
catalog #220.1 

- Originated from a human breast carcinoma; 

o?!«^r^** ^f^? Helene Smith and Dr. Ling 
Chen [Geraldine Brush Cancer Research 
Center, San Francisco, CA (USA)]. This is 
the same as the 600 MPE cell line described 
in Examples l and 2. 

" Originated from a human breast carcinoma; 
obtained from ATCC, catalog # HTB 19 



The following are five fibroblast cell lines with 
total chromosomal number and X chromosomal number in 
parentheses, which were obtained from the NIGMS repository 
[Camden, NJ (USA)]: 

GM01723 (45, XO) 
2^ GM08399 (46, XX) 

GM04626 (47, XXX) 
GM01415E (48,XXXX) 
GM05009B (49,XXXXX). 

30 Results and Diseuss^nnt 

Demonstrated herein is CGH's capability of detecting 
and mapping relative DHA sequence copy number between genomes. 
A comparison of DNAs from malignant and normal cells permits 
the generation of a "copy number karyotype- for a tumor, 
thereby identifying regions of gain or loss of DNA. 

Demonstrated is the use of dual color fluorescence 
in_aifei hybridization of differently labeled DNAs from a 
subject tumor genome and a normal human genome to a normal 
human metaphase spread to map DNA sequence copy number 
throughout the tumor genome being tested. Regions of gain or 
loss of DNA sequences, such as deletions, duplications or 
amplifications, are seen as changes in the ratio of the 
intensities of the two f luorochromes (used in this 
representative example) along the target chromosomes. 
Analysis of tumor cell lines and primary bladder tumors 
identified 16 different regions f amplification, many in loci 
not previously known to be amplified. Those results are shown 
in Table 3 below. 
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The turn r DNA is detected with the green fluorescing 
FiTC-avidin, and the normal DNA with the red fluorescing 
rhodamine anti-digoxigenin. The relative amoxmts of tumor and 
normal DNA bound at a given chromosomal locus are dependent on 
5 the relative abxindance of those sequences in the two DNA 

samples, and can be quantitated by measurement of the ratio of 
green to red fluorescence. The normal DNA in this example 
serves as a control for local veuriations in the ability to 
hybridize to target chromosomes. Thus, gene amplification or 

10 chromosomal duplication in the tumor DNA produces an elevated 
green-to-red ratio, and deletions or chromosomal loss cause a 
reduced ratio. The Cot-l DNA included in the hybridization 
inhibits binding of the labeled DNAs to the centromeric and 
heterochromatic regions so those regions are excluded from the 

15 analysis. 

The fluorescence signals were quantitatively 
analyzed by means of a digital image analysis system as 
described above. A software program integrated the green and 
red fluorescence intensities in strips orthogonal to the 

20 chromosomal axis, subtracted local background, and calculated 
intensity profiles for both colors and the green-to-red ratio 
along the chromosomes. 

The ability of CGH to quantitate changes in sequence 
copy nximber that affect an entire chromosome was tested with 

25 the above-listed five fibroblast cell lines having 1 to 5 
copies of the X chromosome and two copies of each autosome. 
Hybridization of DNA from the 45, XO cell line (in green) 
together with normal female DNA (in red) resulted in a uniform 
green-red staining of the autosomes whereas the X chromosome 

30 appeared more red. (The reference spread as indicated above 
was of normal male chromosomes. Some faint staining of a 
small part of the Y chromosome was the result of the binding 
of homologous sequences in the pseudo-autosomal region.) 

Hybridizations with DNA from cell lines carrying 2, 

35 3, 4 or 5 copies of the X chromosome resulted in an 

increasingly strong green fluorescence from the X chromosome 
in relation to the autosomes. The average green-to-red 
f luoresc nc ratio of the X chrom some (Figure 7) , when 
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10 



normalized to the average ratio for the autos mes within the 
same metaphase spread, increased linearly with the increasing 
number of X chromosomes [correlation coefficient (r) = 0.978]. 
Thus, CGH can quantitatively distinguish a change of plus or 
minus one copy of a chromosome at least up to 4 copies. 

Experiments showed that CGH could generate a 
complete copy number karyotype for a near-diploid breast 
cancer cell line, 600PE. According to the published karyotype 
for 600PE [Smith et al., JNCI. 78; 611 (1987)], 600PE is 
near-diploid with five marker chromosomes having four copies 
of the q-arm of chromosome 1, monosomy 16, and deletions of 
9p, llq and 17p. CGH using biotinylated 600PE DNA (in green) 
and normal digoxigenin-labeled DKA (in red) revealed the 
following relative copy number changes: gain of iq and loss 
15 of 9p, I6q, I7p and distal Hq. The green-to-red ratio 

profiles for those aberrant chromosomes are shown in Figure 8. 
Only the q-arm of chromosome 16 showed decreased relative copy 
number suggesting that I6p was not deleted. That observation 
was subsequently confirmed by fluorescence in situ 
hybridization (FISH) to 600PE interphase cells using cosaid 
probes for the p- and q-arms of chromosome 16 [I6p and I6q 
cosmid probes provided by Los Alamos National Laboratory, Los 
Alamos, NM (USA)l; two signals per nucleus for the I6p cosmid 
probe and one for the 16q cosmid probe permitted calibration 
of a green-to-red ratio of i.o as indicating two copies of a 
sequence. 

Thus, if the absolute copy number of any point in 
the tumor genome is known, relative copy numbers can be 
converted to actual copy numbers at all loci. The CGH results 
differed from the originally published karyotype in the region 
of I6p and proximal ip. That discrepancy was resolved by 
locus-specific chromosome-specific painting (FISH) that 
indicated that the components of one of the marker chromosomes 
had been misinterpreted by conventional cytogenetic analysis. 
35 CGH with DNAs from two fibroblast cell lines 

CGM05877 and GM01142A from th NIGMS r pository] detected 
small interstitial deletions around the RB-l locus in 13q— 
del(l3) (pter > ql4.1::q21.2 > qter) and del(l3) (pter > 



20 



25 



30 
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ql4.1::q22.1 > qter) . On the basis of the CGH analysis and 
measurement of the deletion size as a fraction of the length 
of chromosome 13 [total length in megabases (Mb)], those 
deletions were estimated to span about 10 and 20 Mb, 
5 respectively. Thus, it is possible that CGH can be used to 
screen DNA samples from solid tumors in order to identify 
large physical deletions that may uncover recessive mutant 
tumor suppressor genes. 

CGH was evaluated for its ability to detect 
10 increased gene copy number with cell lines that contained 
• previously reported amplification of oncogenes. Figure 9A 
shows CGH with DNA from a colon cancer cell line COLO 320HSR, 
known to contain more than a 50-fold amplification of a 300 kb 
region around the myc oncogene [Kinzku et al., pnas (USA), fil: 
15 1031 (1986)]. The expected high green-to-red ratio at 8q24 
corresponding to the location of bXS is clear. The height of 
the peak does not quantitatively reflect the level of 
amplification because the fluorescent signal spread over a 
region of the chromosome that is larger than the length of the 
20 amplicon. That is apparently a result of the complex 

organization of the target DNA in the denatured chromosomes. 

The eight-fold amplification of the erbB 2 oncogene 
in the SK-BR-3 breast cancer cell line also was detectable 
with CGH as a hybridization signal at I7ql2 (Table 3). High 
25 level amplifications such as those also could be detected in 
single color-hybridizations with the use of only labeled tumor 
DNA. 

Cytogenetic and molecular studies of primary tumors 
and cell lines often reveal homogeneously staining regions and 

30 double minute chromosomes that do not involve known oncogenes 
[Saint-Ruf et al., Genes Chrom. caneer. is (1990); 

Bruderlein et al., Genes ehrftTn. Canea^-. 63 (1990)]. CGH 

allows straightforward detection and mapping of such 
sequences. Table 3 contains a summary of the analysis with 

35 CGH of 11 cancer cell lines. Data in Table 3 is based on the 
visual inspection of a large number of metaphase spreads 2uid 
on detailed digital imag analysis of four to six metaphases 
for each seunple. 
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TABLE 3 

Mapping of amplified sequences in established cancer 
cell lines and primary tumors by CGH 



5 


Soecimen 


Orioin 


Amnlif , bv CGH* 


Cytogenetic 
evidence of 
gene annlif 




Cell linAfi! 








5637 


Bladder 


3p25, 6p22 


DM 


15 


SK-BR-3 


Breast 


8q24 (m^), 8q21, 
17ql2 (eEbB2), 
20ql3 






Colo 205 


• Colorectal 


6p21, 6q24 






NCI-H508 


Colorectal 


14ql2-13 


DM 


on 


SW480 


Colorectal 


8q24 fmyc) 


DM 




SW620 


Colorectal 


16q21-23 


HSR 


25 


WiDr 


Colorectal 


8q23-24 fmvc^ 






SK-N-MC 


Neuroblastoma 


8q24 fmyc) 


DM 


30 


CaLu3 


Small cell lung 


8pl2-21, 8qtel, 
17ql2 (SEbB2) 


HSR 


caLue 


Small cell lung 


13q32-34 




35 


NCI-H69 


Small cell lung 


2p24 (N-BKS), 
2p21, 2q21 





Primary tmnnyp. 
tJR140 Bladder carcinoma 

40 DR145 Bladder carcinoma 



16q21-22 
6p22 



* The oncogene most likely involved in this amplification is 
^ shown in parentheses. f •'■*i"«-^«ion is 

45 Cytogenetic information based on the ATCC Catalogue of cell 
Lines & Hybridomas (1992). v^taiogue or cell 

reglontT^^ ""^""^^ chromosomes, HSR = homogeneously staining 

^° Sixteen amplified loci were mapped, many at regions 

of the genome where amplification had not previously been 
suspected. Thus, a large variety of genes may be amplified 
during cancer initiation and progression, in five of the li 
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cell lines, more than one locus was amplified. Two or three 
separate loci on the same chromosome were amplified in four 
cell lines, which suggests a spatial clustering of chromosomal 
locations that undergo DNA amplification (Table 3 and Figure 
5 9B). 

CGH was also applied to identify and map amplified 
DNA sequences in uncultured primary bladder tumors. Of the 
seven tumors tested, two showed evidence of DNA amplification 
but the loci were not the same (Table 3) . Thus, a nvunber of 
10 previously unsuspected genomic regions that might contain 

genes important for cancer progression have been identified by 
CGH. Further studies will elucidate which of those loci 
contain novel oncogenes and which represent coincidental, 
random DNA amplification characteristic of genomic 
15 instability. 

The detection and mapping of unknown eunplif ied 
sequences that typically span several hundred kilobases (kb) 
to a few Mb demonstrated the usefulness of CGH for rapid 
identification of regions of the genome that may contain 
20 oncogenes. Analogously, detection of deletions may facilitate 
identification of regions that contain tumor suppressor genes. 

Further studies are necessary to establish to. what 
extent allelic losses in tumors are caused by physical 
deletions. In clinical specimens, the detection of small copy 
25 number differences is more difficult than with cell lines 
because of the admixture of DNA from contaminating normal 
cells and because of intratumor heterogeneity. As indicated 
above, using PGR to prepare tumor DNA from a small number of 
tumor cells (as a tumor clonal sub-population) may assist in 
30 resolving that problem. Like RFLP, CGH emphasizes the 
detection of aberrations that are homogeneous in a cell 
population and averages those that are heterogeneous. 

At the current stage of development of CGH, 
sensitivity is primarily limited by the granularity of the 
35 hybridization signals in the metaphase chromosomes. Further 
improvements in sensitivity will be achieved by optimization 
of the probe concentration and labeling, and by the av raging 
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Of the green-to-red fluorescence ratios fr m several metaphase 
spreads . 

The descriptions of the foregoing embodiments of the 
invention have been presented for purposes of illustration and 
5 description, 'fhey are not intended to be exhaustive or to 
limit the invention to the precise form disclosed, and 
obviously many modifications and variations are possible in 
light of the above teachings* The embodiments were chosen and 
described in order to best explain the principles of the 
10 invention and its practical application to enable thereby 
• others skilled in the art to best utilize the invention in 
various embodiments and with various modifications as are 
suited to the particular use contemplated. It is intended 
that the scope of the invention be defined by the claims 
15 appended hereto. All references cited herein are incorporated 
by reference. 
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1. A method of comparing copy numbers of different 
DNA sequences in a subject cell or cell population comprising 
the steps of: 

5 a) extracting the DNA from the subject cell or from 

a number of cells of the subject cell population; 

b) amplifying said extracted subject DNA, if 

necessary; 

c) labeling the subject DNA; 

^° ^) hybridizing said labeled subject DNA in situ to 

reference metaphase chromosomes after substantially removing, 
from the labeled subject DNA those repetitive sequences that 
could bind to multiple loci in the reference metaphase 
chromosomes, and/ or after blocking the binding sites for those 

15 repetitive sequences in the reference metaphase chromosomes by 
prehybridization with appropriate blocking nucleic. acids, 
and/or blocking those repetitive sequences in the labeled 
subject DNA by prehybridization with appropriate blocking 
nucleic acid sequences, and/or including such blocking nucleic 

20 acid sequences for said repetitive sequences during said 
hybridization, wherein the DNA sequences in the labeled 
subject DNA that bind to single copy sequences in the 
reference metaphase chromosomes are substantially retained, 
and those single copy DNA sequences as well as their binding 

25 sites in the reference metaphase chromosomes remain 
substantially unblocked both before and during the 
hybridization; 

e) rendering the bound, labeled subject DNA 
sequences visual izjOjle, if necessary; 

^° ^) observing and/ or measuring the intensity of the 

signal from the bound labeled subject DNA sequences as a 
function of position on the reference metaphase chromosomes; 
and 

g) comparing the copy numbers of different DNA 
35 sequences of the subject DNA by comparing the signal 

intensities at dif f rent positions on the reference metaphas 
chromosomes, wherein the greater the signal intensity at a 
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given position,, the greater the copy nxmber of the sequences 
in the sxibject DNA that bind at that position. 

2. A method according to Claim l wherein the copy 
5 number of a subject DNA sequence tslnding at one position in 

the reference metaphase chromosomes relative to the copy 
number of a sequence binding at another position is quantified 
by measuring the ratio of the signal intensities at the two 
locations • 

10 

3. A method according to Claim 1 further comprising 
the addition of an unlabeled nucleic acid to said 
hybridization mixture wherein said unlabeled nucleic acid has 
a sufficient number of nucleic acid sequences substantially 

15 complementary to the sequences in the reference metaphase 

chromosomes to prevent saturation of the binding sites in the 
reference metaphase chromosomes by the labeled subject DNA. 

4. A method according to Claim 1 wherein the 

20 reference metaphase chromosomes are human and prehybridized 
with human genomic DNA or human genomic DNA enriched in high 
copy repetitive sequences, and wherein human genomic DNA 
and/ or human genomic DNA enriched in high copy repetitive 
sequences are included in the hybridization. 

25 

5. A method according to Claim 1 vrtierein in step 
(f) and/ or (g) an image analysis system is used. 

6. A method according to Claim 1 wherein signal 
30 intensities significantly greater than average indicate the 
presence and locations on the reference metaphase chromosomes 
of sequences that are duplicated or amplified in the subject 
cell or cell population whereas signal intensities that are 
significantly less than average indicate the presence and 
35 locations of sequences that are deleted in the subject cell or 
cell population. 
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7. A method according to Claim i wherein the 
reference metaphase chromosomes are from an antenna cell line. 

8. A method of comparing copy numbers of different 
5 RNA sequences in a subject cell or cell population comprising 

the steps of: 

a) extracting the RNA from the subject cell or from 
a number of cells of the subject cell population; 

b) amplifying said extracted subject RNA, if 

10 necessary; 

c) labeling the subject RNA; 

d) hybridizing said labeled subject RNA in situ to 
reference metaphase chromosomes after substantially removing 
from the labeled subject RNA those repetitive sequences that 

15 could bind to multiple loci in the reference metaphase 

chromosomes, and/ or after blocking the binding sites for those 
repetitive sequences in the reference metaphase chromosomes by 
prehybridization with appropriate blocking nucleic acids, 
and/ or blocking those repetitive sequences in the labeled 

20 subject RNA by prehybridization with appropriate blocking 

nucleic acid sequences, and/or including such blocking nucleic 
acid sequences for said repetitive sequences during said 
hybridization, wherein the RNA sequences in the labeled 
sxibject RNA that bind to single copy sequences in the 

25 reference metaphase chromosomes are substantially retained, 

and those RNA sequences as well as their binding sites in the 
reference metaphase chromosomes remain substantially unblocked 
both before and during the hybridization; 

e) rendering the bound, leUDeled subject RNA 
30 sequences visualizable, if necesscury; 

f ) observing and/or measuring the intensities of the 
signals from the bound labeled subject RNA sequences as a 
function of position on the reference metaphase chromosomes; 
and 

35 g) comparing th copy numbers of different RNA 

sequences of the subject RNA by comparing the signal 
intensities at differ nt positions on the refer nee metaphase 
chromosomes, wherein th greater the signal intensity at a 
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given position^ the greater the copy niimber of the sequences 
in the sxibject RNA that bind at that position. 

9. A method of con5>aring copy nmnbers of different 
5 DNA sequences in one subject cell or cell population relative 
to copy numbers of sxibstantially identical sequences in 
another cell or cell population, said method comprising the 
steps of: 

a) extracting the DNA from both of the subject cells 
10 or cell populations; 

b) amplifying said extracted sxibject DMAs, if 

necessary; 

c) labeling each of the subject DNAs differently; 

d) hybridizing said differently labeled stabject DNAs 
15 in situ to reference metaphase chromosomes after substantially 

removing from the labeled DNAs those repetitive sequences that 
coTild bind to multiple loci in the reference metaphase 
chromosomes, and/ or after blocking the binding sites for those 
repetitive sequences in the reference metaphase chromosomes by 

20 prehybridization with appropriate blocking nucleic acids, 

and/ or blocking those repetitive sequences in the labeled DNA 
by prehybridization with appropriate blocking nucleic acid 
sequences, and/ or including such blocking nucleic acid 
sequences for said repetitive sequences during said 

25 hybridization; 

e) rendering the bound, differently labeled DNA 
sequences visualizable, if necessary; 

f ) observing and/or measuring the intensities of the 
signals from each subject DNA, and the relative intensities, 

30 as a function of position along the reference metaphase 
chromosomes ; cmd 

g) comparing the relative intensities among 
different locations along the reference metaphase chromosomes 
wherein the greater the intensity of the signal at a location 

35 due to on subject DNA relative to the intensity of the signal 
due to the other subject DNA at that location, the greater the 
copy number of the sequence that binds at that location in the 
first subject cell or cell population relative to the copy 
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number of the substantially identical sequence in the second 
subject cell or cell population that binds at that location. 

10. A method of quantitatively comparing copy 
5 numbers of different DNA sequences in one subject cell or cell 
population relative to copy numbers of substantially identical 
sequences in another subject cell or cell population, said 
method comprising steps (a) through (e) of claim 9 and steps 
of: 

measuring the intensities of the signals from 
each of the bound subject ONAs and calculating the ratio of 
the intensities as a function of position along the reference 
metaphase chromosomes to form a ratio profile; and 

g. quantitatively comparing the ratio profile among 

15 different locations along the reference metaphase chromosomes, 
said ratio profile at each location being proportional to the 
ratio of the copy number of the DNA sequence that binds to 
that location in the first subject cell or cell population to 
the copy number of a substantially identical sequence in the 

10 second cell or cell population. 



25 



30 



35 



11. A method to determine the ratio of copy numbers 
of different DNA sequences in one subject cell or cell 
population to copy numbers of substantially identical 
sequences in another cell or cell population, said method 
comprising the steps of (a) through (f) of Claim 10 and steps 



of: 



g. determining the average copy number of a 
calibration sequence in both subject cells or cell 
populations, said calibration sequence being substantially 
identical to a single copy sequence in the reference metaphase 
cells; and 

h. normalizing the ratio profile calculated in (f) 
so that at the calibration position, the ratio profile is 
equal to th ratio of the averag copy numbers det rmined in" 
(g) , the normalized ratio profile at any other location along 
the reference metaphase chromosomes thereby giving the ratio 
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of the copy numbers of the DNA secjuences in the two subject 
DNAs that would bind at that location. 

12. The method according to Claim 11 further 
5 comprising determining the ratio of copy numbers of DNA 

sequences in more than two subject DNAs wherein the comparing 
is done pairwise between signals from each subject DNA. 

13. The method according to Claim 11 ftirther 
10 comprising determining the copy numbers for more than one 

calibration position in step (g) ; and doing the normalizing in 
step (h) to obtain the best fit of the ratio profile to the 
calibration positions. 

15 14. A method for comparing copy numbers of 

different DNA sequences in a test cell or cell population r 
said method comprising applying steps (a) through (f ) of Claim 

9 wherein one of the subject cells or cell populations is the 
test cell or cell population and the other is a normal cell or 

20 cell population, and step (g) of comparing the relative 
intensities eunong different locations along the reference 
metaphase chromosomes, wherein the greater the relative 
intensity at a location, the greater the copy number of the 
sequence in the test cell or cell population that binds to 

25 that location, except for sex chromosomes where the compeurison 
needs to take into account the known differences in copy 
numbers of sequences in the sex chromosome in relation to 
those on the autosomes in the normal subject cell or cell 
population. 

30 

15. A method for compzuring the copy number of 
different DNA sequences in a test cell or cell population, 
said method comprising applying steps (a) through (f ) of Claim 

10 wherein one of the sxabject cells or cell populations is the 
35 test cell or c 11 population, and the other is a standard c 11 

or cell population wherein the known copy numbers of the DNA 
sequ nces that bind to different positions on the reference 
metaphase chromosomes is known, emd the following steps: 
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g. adjusting the ratio profile at each location 
along the reference metaphase chromosomes by multiplying the 
ratio profile by the known copy number of DNA sequences in the 
standard cell or cell population that bind there; and 
5 h. comparing the adjusted ratio profiles at 

different locations along the reference metaphase chromosomes 
wherein the greater the adjusted ratio profile at a location, 
the greater the copy number of the DNA sequence in the test 
cell or cell population that binds there. 

10 

16. A method of determining the ratios of the copy 
numbers of different DNA sequences in a test cell or cell 
population, said method comprising applying steps (a) through 
(g) of Claim 15 and calculating the ratio of the copy number 

15 of a DNA sequence in the test cell or cell population that 
binds to one location on the reference metaphase chromosomes 
to the copy number of a sequence that binds to another 
location by dividing the adjusted ratio profile at the 
location of the first sequence by that at the location of the 

20 second. 

17. A method for determining the copy number of 
different DNA sequences in a test cell or cell population, 
said method comprising 

25 applying steps (a) through (g) of Claim 15; 

h. determining the copy number of a calibration 
sequence in the test cell or cell population that is 
substantially identical to a single copy sequence in the 
reference cells; and 

30 i. normalizing the adjiisted ratio profile so that 

at the location of the calibration sequence on the reference 
metaphase chromosomes, the normalized, adjusted ratio profile 
is equal to the copy number of the calibration sequence 
determined in (h) , the value of the normalized, adjusted ratio 

35 profile at another location then being equal to th copy 

number of the DNA s qu nee in the t st cell or cell population 
that binds at that location. 
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18. The method of Claim 17 wherein two or more 
calibration sequences are used, and the adjusted ratio profile 
is normalized to get the best fit to the copy numbers of the 
ensemble of calibration sequences. 

19. A method of comparing copy numbers of different 
RNA sequences in one subject cell or cell population relative 
to copy numbers of substantially identical sequences in 
another cell or cell population, said method comprising the 
steps of: 

a) extracting the RNA from both of the subject cells 
or cell populations; 

b) amplifying said extracted subject RNAs, if 

necessary; 

c) labeling each of the subject RNAs differently; 

d) hybridizing said differentially labeled subject 
RNAs i,n ^iPi to reference metaphase chromosomes after 
substantially removing from the labeled RNAs those repetitive 
sequences that could bind to multiple loci in the reference 
metaphase chromosomes, and/or after blocking the binding sites 
for those repetitive sequences in the reference metaphase 
chromosomes by prehybridization with appropriate blocking 
nucleic acids, and/or blocking those repetitive sequences in 
the labeled RNA by prehybridization with appropriate blocking 
nucleic acid sequences, and/or including such blocking nucleic 
acid sequences for said repetitive sequences during said 
hybridization; 

e) rendering the bound, differently labeled RNA 
sequences visualizable, if necessary; 

f ) observing and/or measuring the intensities of the 
signals from each subject RNA, and the relative intensities as 
a function of position along the reference metaphase 
chromosomes ; and 

g) comparing the relative intensities among 
different locations along the reference metaphase chromosomes 
wherein the greater the intensity of the signal at a location 
due to one subject RNA relative to the intensity of the signal 
due to the oth r subject RNA at that location, the greater th 
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5 



10 



copy number of the sequence that binds at that location i„ the 
first subject cell a cell population relative to the copy 
nmober of the substantially identical sequence in the second 
subject cell or cell population. 

20. A method of detecting amplification of a 
certain sequence or group of sequences in a subject cell or 
cell population, comprising steps (a) through (e) of claim i 
Wherein the is_sita hybridization is targeted to antenna cells 
in which the DNA sequence{s) to be tested for is or are 
amplified, and examining the reference cell for regions that 
are hybridized significantly more intensely than others, the 
presence of such regions indicating amplifications of the 
sequence (s) which are being tested. 
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